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PREFACE 


The title of this book is the best brief description of its content and purposes 
that the author was able to think of. These purposes are rather different from those 
of most books on “higher geometry” or the foundations of geometry. The difference 
that is involved here is somewhat like the difference between the two types of 
advanced calculus courses now commonly taught. Some courses in advanced cal- 
culus teach material which has not appeared in the preceding courses at all. There 
are others which might more accurately be called courses in elementary calculus 
from an advanced standpoint: their purpose is to clean up behind the introductory 
courses, furnishing valid definitions and valid proofs for concepts and theorems 
which were already known, at least in some sense and in some form. One of the 
purposes of the present book is to reexamine elementary geometry in the same 
spirit. 

If we grant that elementary geometry deserves to be thoroughly understood, 
then it is plain that such a job needs to be done; and nc such job is done in any 
college course now widely taught. The usual senior-level courses in higher geometry 
proceed on the very doubtful assumption that the foundations are well under- 
stood. And courses in the foundations (when they are taught. at all) are usually 
based on such delicate postulate sets, and move so slowly, that they cover little 
of the substance of the theory. The upshot of this is that mathematics students 
commonly leave college with an understanding of elementary geometry which is 
not much better than the understanding that they acquired in high school. 

The purpose of this book is to elucidate, as thoroughly as possible, both this 
elementary material and its surrounding folklore. My own experience, in teach- 
ing the course to good classes, indicates that it is not safe to presuppose an exact 
knowledge of anything. Moreover, the style and the language of traditional 
geometry courses are rather incongruous with the style and the language of the rest 
of mathematics today. This means that ideas which are, essentially, well under- 
stood may need to be reformulated before we proceed. (See, for example, Chapter 6, 
on Congruences Between Triangles.) In some cases, the reasons for reformula- 
tion are more compelling. For example, the theory of geometric inequalities is 
used in the chapter on hyperbolic geometry; and this would hardly be reasonable 
if the student had not seen these theorems proved without the use of the Euclid- 


ean parallel postulate. 
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For these reasons, the book begins at the beginning. Some of the chapters are 
quite easy, for a strong class, and can simply be assigned as outside reading. Others, 
such as Chapters 20 and 24, are more difficult. These differences are due to the 
nature of the material: it is not always possible to get from one place to another 
by walking along a path of constant slope. 

This unevenness in level of difficulty makes the book rather flexible. A well- 
prepared class may go rapidly through Chapters 1-7, and devote most of its time 
to the sort of material presented in Chapters 8, 10, 14, 19, and 20. A poorly pre- 
pared class may go carefully through Chapters 1-7, omit such chapters as 14 
and 20, and still not get to Chapter 25. 

The book is virtually self-contained. The necessary fragments of algebra and 
the theory of numbers are presented in Chapters 29 and 30, at the end. At many 
points, ideas from algebra and analysis are needed in the discussion of the geometry. 
These ideas are explained in full, on the ground that it is easier to skip explana- 
tions of things that are known than to find convenient and readable references. 
The only exception to this is in Chapter 25, where it seemed safe to assume that 
epsilon-delta limits are understood. 

In some chapters, especially Chapters 20 and 25, we give full expositions of 
topies which are commonly dismissed rather briefly and almost casually. The 
process by which the real numbers are introduced into an Archimedean geometry, 
for purposes of measurement, is highly important and far from trivial. The same 
is true of the consistency proof for the hyperbolic postulates. Here (as elsewhere) 
the purpose of this book is to explain, with all possible lucidity and thoroughness, 
ideas which are widely alluded to but not so widely understood. 


Cambridge, Mass. E. E. M. 
October 1962 


p—— 


CONTENTS 


———e— 


Chapter 1. THE ALGEBRA OF THE REAL NUMBERS 


1.1 Introduction ; > 1 
1.2 Addition and multiplication of real numbers 3 1 
13 Fields . MID 9 
14 The ordering of the real numbers 10 
1.5 Order relations and ordered fields z 18 
1.6 The positive integers and the induction principle, 16 
1.7 The integers and the rational numbers. . 22 
1.8 The Archimedean postulate; Euclidean completeness. 24 
1.9 The language and notation of sets . . 28 
1.10 n-fold sums and products; the generalized associative lw 29 
1.11 The closure of the integers under addition and multiplication 33 
Chapter 2. INCIDENCE GEOMETRY IN PLANES AND SPACE. 36 
Chapter 3. DISTANCE AND CONGRUENCE 
3.1 The idea of a function F 40 
3.2 The set-theoretic interpretation of fapetions and relations 7 44 
3.3 The distance function $ T ait 47 
3.4 Betweenness . 50 
3.5 Segments, rays, angles, A MEC LI 54 
3.6 Congruence of segments . 58 
Chapter 4. SEPARATION IN PLANES AND SPACE 
41 Convexity and separation qs xo EUM xS 0L 
4.2 Incidence theorems based on the planeseparation postulate da N EE CR 
4.3 Incidence theorems, continued . : 67 
4.4 Convex quadrilaterals x 70 
4.5 Separation of space by planes 71 
Chapter 5. ANGULAR MEASURE 
5.1 Degree measure for angles 74 
Chapter 6. CONGRUENCES BETWEEN TRIANGLES 
6.1 The idea of a congruence 3 er, 81 
6.2 The basic congruence theorems . 85 
6.3 Some remarks on terminology . 90 
6.4 The independence of the SAS postulate 91 
6.5 Existence of perpendiculars . " 93 
Chapter 7. GEOMETRIC INEQUALITIES . 95 
Chapter 8. THE EUCLIDEAN PROGRAM: donati WITHOUT DISTANCE 
8.1 The synthetic postulates — . . 102 
8.2 Right angles, synthetically considered. . Qa MT ee co take 
8.3 Geometric inequalities, synthetically considered . 107 


vii 


viii CONTENTS 


8.4 Proofs of the laws of inequality for segments . 
8.5 Asummary of the differences between the metric and ayutbetio Sppronction 


Chapter 9. THREE GEOMETRIES 
9.1 Introduction 3 
9.2 The Poincaré model for Lobachevekian. geometry 
9.3 The spherical model for Riemannian geometry 
9.4 Some questions for later investigation . 


Chapter 10. ABSOLUTE PLANE GEOMETRY 
10.1 Sufficient conditions for parallelism 
10.2 The polygonal inequality 
10.3 Saccheri quadrilaterals : 
10.4 The basic inequality for angle-sums i ina rangle , 
10.5 A historical comedy 


Chapter 11. THE PARALLEL POSTULATE AND PARALLEL PROJECTION 
11.1 The uniqueness of parallels . 
11.2 Parallel projections 
113 The comparison theorem . 
11.4 The basic similarity theorem 


Chapter 12. SIMILARITIES BETWEEN TRIANGLES 
12.1 Proportionalities 
12.2 Similarities between triangles: 
12.3 The Pythagorean theorem 


Chapter 13. POLYGONAL REGIONS AND THEIR AREAS 
13.1 The area postulates 
13.2 Area theorems for triangles and quadiiatersie. 
13.3 Applications of area theory: a simple proof of the basic aishilavity theorem 
13.4 Further applications of area theory: the die mers theorem . 
13.5 A weaker form of the unit postulate A-5 . d 


Chapter 14. THE CONSTRUCTION OF AN AREA FUNCTION 
14.1 The problem y 
14.2 Complexes and their formule: areas. . 
14.3 Verification of the area postulates for the function a 


Chapter 15. PERPENDICULAR LINES AND PLANES IN SPACE 
15.1 The basic theorems 
15.2 Parallel lines and planes in apane 3 
15.3 The measure of a dihedral angle: perpeadioular lines. 


Chapter 16. CIRCLES AND SPHERES 
16.1 Basic definitions x 
16.2 Secant and tangent lines. "The line-cirele theorem 
16.3 Secant and tangent ree 4 
16.4 Ares of circles . 
16.5 The two-circle theorem 


Chapter 17. RIGID MOTION 
17.1 The most general concept of congruence 
17.2 Isometries between triangles . 


110 
112 


114 
115 


CONTENTS 


Chapter 18. CARTESIAN COORDINATE SYSTEMS 
18.1 Coordinates, equations, and graphs 


Chapter 19. CONSTRUCTIONS WITH RULER AND COMPASS 
19.1 Unmarked rulers and collapsible compasses 
19.2 How to do algebra with ruler and compass 
19.3 Solving equations with ruler and compass . 
19.4 The problem of Apollonius 
19.5 The impossible construction problema of antiquity 
19.6 The surd field . HN a 
19.7 The surd plane . takaria peta yo «E o 
19.8 Quadratie extension of fields. Conjugates in a quadratic extension field 
19.9 Surd fields of order n, surds of order n . . ^i rcli ees 
19.10 Applications to cubic equations with rational coefficients 
19.11 The trisection of the angle > Aaa 
19.12 The duplication of the cube . 


Chapter 20. FROM EUDOXUS TO DEDEKIND 
20.1 Proportionalities without numbers . 4 
20.2 Eudoxus' synthetic definition of proportionality : 
20.3 The algebra of segment addition a 
20.4 How to define ratios: the supremum 
20.5 Proof of the addition theorem 
20.6 The metrization theorem 
20.7 The Dedekind cut . 


Chapter 21. LENGTH AND PLANE AREA 
21.1 The definition of arc length . 
21.2 The existence of r à 
21.3 Limits as the mesh approaches se zero 
21.4 The addition theorem for arc length 
21.5 Approaching the area of a circular sector from. below, 
21.6 Approaching the area of a circular sector from above 
21.7 The area of a sector 


Chapter 22. JORDAN MEASURE IN THE PLANE 
22.1 The basic definition e a: , 
22.2 The class of measurable sets . i 
29.3 Areas under the graphs of continuous functions 


ix 


208 


x CONTENTS 


24.3 Hyperbolic geometry: closed triangles and angle sums . . 317 
244 Hyperbolic geometry: the defect of a triangle and the — of [similarity 
theory . . ^ 321 
24.5 Absolute geometry: triangulations and subdivisions UNS eee eee 
24.6 Euclidean geometry: Bolyai's theorem . . te oe oe eee 
24.7 Hyperbolic area-theory: the defect of a polygonal | region ved eom deem cot 
24.8 Bolyai's theorem, for triangles, in the hyperbolic case . - - - + - 334 
249 Defectsofsmalltriangles - -s - - - > c c c oc ttt M 337 
24.10 The continuity of the defect . . iere 
24.11 Bolyai's theorem for poly gonal — in the Byperbolie: DM Sn Ae 14, Oe 
24.12 The impossibility of Euclidean area-theory in hy, gata ore. . . 943 | 
24.13 The uniqueness of hyperbolic area-theory . . . - . . 945 | 
24.14 Alternative forms of the parallel postulate . . . UE DEO 
Chapter 25. THE CONSISTENCY OF THE HYPERBOLIC POSTULATES 
25.1 Introduction . . .. 948 
25.2 Inversions of a punctured: plane AT De ee ee. 
25.3 Preservation of the cross ratio under inversions . . . . - . - + 352 
254 Preservation of angular measure under inversions . . . . . . . 353 
25.5 Reflections across L-lines in the Poincaré model . . . . . . . - 397 
25.6 Uniqueness of the L-line through two points . - res d. e e oae: | 
25.7 The ruler postulate, betweenness and plane separation se re S SCA 
25.8 Angular measure in the Poincaré model . . mie ec Se ow AST 
25.9 The SAS postulate . . . f DT Ls ERGO 
Chapter 26. THE Poner OF EUCLIDEAN GEOMETRY 
26.1 Introduction . . - e M CONIMST | 
Dep "rfeiruler'postul&te- 9-4. RE T S LR ee S. i O08, 
26.3 Incidence and parallelism . - - - - s + + + c c0 oc o7 7 369 | 
26.4 Translations and rotations... - - + + + + + ot ot on 7 371 
38:5. V'Plane:séparation - 1-540. sage AS e duo STE E. cM LE a 375 | 
26.6 Angle congruences . . LA US NO. SRE, BRP BOTT 
Chapter 27. THE POSTULATIONAL METHOD 
27.1 Introduction . i ow aS oO erp EE ess 
27.2 Postulates considered à as seli-evident trtiths WAAN, ROSE GS actqeness 
27.3 Categoric postulate sets . - SOA AMG Te T, 1,5 SOS 
27.4 The use of postulate sets as codification i. got XN TT, X38 388 
27.5 The use of postulates as a matter of convenience. . . - . ee 887 
Chapter 28. AN EXAMPLE OF AN ORDERED FIELD WHICH IS NOT 
ARCHIMEDEAN.. . - TE . . 888 
Chapter 29. THE THEORY OF NUMBERS oan sons E res Ie alien DATE 
Chapter 30. THE THEORY OF EQUATIONS i o een o6 mth V #5 400 
Chapter 31. LIMITS OF SEQUENCES 
31.1 The definition of a limit for sequences . . . up aen cite magia 
Chapter 32. COUNTABLE AND UNCOUNTABLE SETS . . . . . . . 408 


Hupék 00. ey INDIA teft 900 ota SEN ORC Unt. Sosa TEIN HA RIS 


CHAPTER ] 


THE ALGEBRA OF THE REAL NUMBERS 


A ———————— 


1.1 INTRODUCTION 


In this book, we shall be concerned mainly with geometry. We shall begin at 
the beginning and base all our work on postulates which will be carefully stated. 
Our postulates for geometry will be somewhat different from the ones that you 
are familiar with from elementary courses, but the general way in which we use 
them will be the familiar way which has been used in geometry ever since Euclid. 
The scheme (to put it briefly) is to state at the outset what we are assuming and 
base our later conclusions on these explicitly stated assumptions. 

In modern mathematics, however, geometry does not stand alone the way that 
it did in the time of Euclid. In modern mathematics, the real number system plays 
a central role, and geometry is far easier to deal with and to understand if real 
numbers are allowed to play their natural part. For this reason, the first step in 
our program is to put the real number system on the same solid basis that we in- 
tend to provide for geometry, by first stating our assumptions clearly and then 
building on them. 


1.2 ADDITION AND MULTIPLICATION OF REAL NUMBERS 


We shall think of the real numbers as being arranged as points on a line like 
this: 


Figure 1.1 


The real numbers include, at least, all of the following: 


(1) the positive integers 1, 2, 3,..., and so on; 
(2) the integer 0; 
(3) the negative integers —1, —2, —3,..., and so on; 
(4) the fractions with integers as numerators and integers different from 0 as 
denominators. For example, 4, —8 and 1,000,000/1 are numbers of this type. 
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Note that this fourth kind of number includes the first three, because every 
integer n is equal to n/1. What we have so far, then, are the numbers of the form 
p/q, where p and q are integers and q is not 0. These are called the rational numbers. 
This term is not meant to suggest that any other kind of number must be crazy. 
It merely refers to the fact that a rational number is the ratio of two integers. 
As you probably know, there are many real numbers that are not of this type. 
For example, /2 is not the ratio of any two integers. Such numbers are called 
irrational. 

We proceed to state the basic properties of the real number system in the form 
of postulates. We have given a set R, whose elements are called real numbers 
(or simply numbers, if the context makes it clear what is meant). We have given 
two operations, addition and multiplication, denoted by + and -. Thus the alge- 
braic structure that we are dealing with is a triplet 


[R, +; d 
The properties of the system are as follows: 


A-1. R is closed under addition. That is, if a and b belong to R, then a + b 
also belongs to R. 


A-2. Addition in R is associative. That is, if a, b, and c belong to R, then 
a+(b+c) = (a 4-0) 4- c. 
A-3. There is exactly one element of R, denoted by 0, such that 


3 a+0=0+a=a 
for every a in R. 
A-4. For every a in R there is exactly one number —a in R, called the negative 


of a, such that 4 
a+ (—a) = (—a) + a = 0. 


A-5. Addition in R is commutative. That is, if a and b belong to R, then 
a+b=b+a. 


These postulates are numbered A-1 through A-5 because they are the postulates 
that deal with addition. We now move on to multiplication. 


M-1. R is closed under multiplication. That is, if a and b belong to R, then ab 
belongs to R. 


(Here and hereafter, we denote the product a+b simply as ab. This is a matter 
of convenience, and we shall not do it consistently. For example, when we write 
26, we mean twenty-six, not twelve.) 


M-2. Multiplication in R is associative. That is, if a, b, and c belong to R, 
then 
a(bc) = (ab)c. 
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M-3. There is exactly one element of R, denoted by 1, such that 


a al = la = a 
for every a in R. 


M-4. For every a in R, other than 0, there is exactly one number a^, called the 
reciprocal of a, such that 


M-5. Multiplication in R is commutative. That is, if a and b belong to R, 
then 
ab = ba. 

M-6. 1 is different from 0. 

This postulate may look peculiar, but it is necessary. Under the preceding 
postulates, we have no guarantee that R contains any number at all except 0. 

The postulates, so far, have dealt with addition and multiplication separately. 
These two operations are connected by the following postulate. 


AM-1. The Distributive Law. If a, b, and c belong to R, then 
a(b + c) = ab + ac. 


In addition to these postulates, you may feel the need for the following two 
statements: 


E-1. If 
a=b and e= d 
then 
a+c=b+d. 
E-2. If 
a=b and c=d, 
then 


ac = bd. 


Here it should be understood that a, b, c and d belong to R. But these statements 
are not really postulates for the real number system. They merely serve to remind 
us of what addition and multiplication are all about. The first one says that the 
sum of two numbers depends only on the numbers, and does not depend on the 
letters that we happen to use to denote the numbers; similarly for the second 
“law” E-2. 

Throughout this book, the symbol “=” will always mean "is the same as.” 
As usual, the symbol “#” means “is different from.” 

Subtraction is defined by means of the negatives given by A-4. That is, 


a — b = a + (—b), 
by definition. Similarly, division is defined by means of the reciprocals given by 
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M-4. Thus, if b # 0, then 
f= atb= ab", 
by definition. 
From the above postulates, all of the usual laws governing addition and multipli- 
cation can be derived. We start the process as follows. 


Theorem 1. a0 = 0 for every a. 
Proof. By A-3, we have 


L=1+0. 
Therefore 
: a-:1 = a(l + 0). 
Hence 
a = al + al, 
a = a + a0, 
(—a) + a = (—a) + (a + a0), 
0 = [(—a) + a] + a0, 
0 = 0+ a0, 
and 
0 = a0, 


which was to be proved. (You should be able to give the reasons for each step by 
citing the appropriate postulates.) 


Theorem 2. If ab = 0, then either a = 0 or b = 0. 


Proof. Given ab = 0. We need to show that if a ~ 0, then b = 0. Ifa # 0, 
then a has a reciprocal a~*. Therefore 
a-'(ab) = a 10 
=0. 
But 
a~'(ab) = (a~‘a)b 
1b = b. 


Thus b = 0, which was to be proved. 
This, of course, is the theorem that you use when you solve equations by 


factoring. If 
(z — I) — 2) = 0, 
then either z = 1 or z = 2, because the product of the numbers z — 1 andz — 2 


cannot be 0 unless one of the factors z — 1 and z — 2is0. You need this prin- 


ciple to be sure that nobody is going to find an extra root by investigating the 
equation by some other method. 


Theorem 3. 0 has no reciprocal. That is, there is no number z such that 0z = L 
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Proof. We know that 0x = 0 for every x. If we had Ox = 1 for some z, it would 
follow that 0 = 1. This is impossible, because M-6 tells us that 0 ¥ 1. 

This theorem gives the reason why division by 0 is impossible. If division by 
0 meant anything, it would mean multiplication by the “reciprocal of 0.” Since 
there is no such reciprocal, there is no such operation as division by 0. 


Theorem 4. The Cancellation Law of Addition. If a + b = a + c, then b — c. 


Proof. If 
at+tb=a+te, 
then 
(—a) + (a + b) = (—a) + (a + €), 
[(—a) + a] + b = [(—a) +a] + e, 
0+b=0+¢, 
and 
bees 


Theorem 5. The Cancellation Law of Multiplication. If ab = ac, and a # 0, 
then b = c. 


Proof. If ab = ac, anda # 0, then a has a reciprocal a^ !. Therefore 
a^ !(ab) = a^ (ac), 
(a—'a)b = (a-a)e, 


1b =le, 


and 
b=. 


Theorem 6. —(—a) = a, for every a. 


Proof. By definition of the negative, the number —(—a) is the number z such 


that 
(—a) +z =z + (—2) = 0. 


The number a has this property, because 
(—a) -- a= a+ (—a) = 0. 
But A-4 tells us that every number has exactly one negative. Therefore a is the 
negative of —a, which was to be proved. 
Theorem 7. (—a)b — — (ab), for every a and b. 
Proof. What we need to show is that 
(—a)b + ab = ab + (—a)b = 0, 
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because this is what it means to say that (—a)b is the negative of ab. By the com- 
mutative law, it will be sufficient to show that 
(—a)b + ab = 0. 
By the distributive law, 
(—a)b + ab = [(—a) + alb. 
Since (—a) + a = 0, and 0b = 0, we have 
(—a)b + ab = 0, 
which was to be proved. 
This theorem gives the “rule of signs” under which (—2) - 4 = —8, (—7) -4 = 
—28, and so on. 
Theorem 8. (—a)(—b) = ab for every a and b. 


Proof. 
(—a)(—b) = —[a(—5)] 


I oil 
lz 4 
ies 
L3 

~ 
8S E 
© A 


(What is the reason for each step?) 
This, of course, is the second “rule of signs,” which tells us that (—3)(—4) = 12, 
and so on. 


Theorem 9. The reciprocal of the product is the product of the reciprocals. 
That is, 
(ab)! — a b^! 
for every a »* 0, b # 0. 
Proof. We need to show that 


(ab)(a b^) = 1. 
Now 

(ab)(a~"b~") = a[b(a-!b-!)] 
= alib tat) 
= a{(bb~)a~"} 
= a{la~'] 
= aa^! 
zs]. 


Theorem 10. The negative of the sum is the sum of the negatives. That is, 
—(a+ b) = (—a) + (—0). 
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Proof. We need to show that 
(a + b) + [(—a) + (—b)] = 0. 


To avoid an excessive accumulation of parentheses, let us agree, in this proof only, 
to denote —z by x’. We then have 


(a + b) + [(—a) + (—b)] = (a + b) + (a' + b’) 
a+ [b+ (a’ + b’)] 
=a + [b+ (b 4- a^)] 
= a + [(b + b) +a) 
= a + [0 + a] 
=a+a 

= 0. 


ll 


Note that this proof is precisely analogous to the proof of the preceding theorem. 

Obviously, we could go on proving theorems like this indefinitely. In fact, if 
you stop to think, you will realize that nearly every time you have performed an 
algebraic calculation, you have in effect proved a theorem of this sort. For example, 
when you factor x? — a°, and get (x — a)(x + a), you are elaiming that the 
following theorem holds. 


Theorem 11. For every x, a, we have 


(x — a)(x + a) = x? — a?. 


Proof? 


An equation which holds for all real numbers is called an algebraic identity. 
Stated in this language, the two associative laws say that the equations 


a+ (b+c) = (a 4- b) 4- c, 
a(bc) = (ab)c, 


are algebraic identities; the distributive law says that the equation 
a(b +c) = ab + ac 


is an algebraic identity, and so on. 

In the following exercises, it is permissible to use the two associative laws with- 
out comment. In fact, since it doesn’t matter how the terms or factors are grouped, 
we don’t need to indicate a grouping at all; we can write a + b + c to denote 
a -+ (b + c) or (a + b) + c, and similarly for multiplication. We can do the same 
for n-fold products of the form aiaz... an, although the justification for this is 
more complicated than you might think. 

As usual, a? means aa, a? means aaa, and so on. Similarly, 2 means 1 + 1, 
3 means 2+ 1 = 1 + 1+ 1, and so on. 
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PROBLEM SET 1.2 


Show that the following equations are algebraic identities. All these statements are 
to be regarded as theorems, and should be proved on the basis of the postulates and the 
theorems that have previously been proved. A reason should be given for each step in 
the proofs. 

l. b(—a) = —(ab) 
2. (—a)(—b) = ba 
3. a(b 4- c) = ca + ba 
4. a(b — c) = ab — ac 
5. —0 = 0 
6.a—0-2a 
7. ab = ba? (Try to get a very short proof.) 
8.a+a = 2a 
9. (—a) + (—a) = (—2)a 
10. a?(b? + c?) = a?b? + a?e? 
11. a?(b? — c?) = —a2c? + a?b? 
12. (a+ b)(c + d) = ac + be + ad + bd 
13. (a + b)? = a? + 2ab + b? 


The following are discussion questions. 


14. Suppose that subtraction is regarded as an operation. Does this operation obey 
the associative law? That is, is the equation (a — b) — c = a — (b — c) an algebraic 
identity? If not, under what condition does the equation hold? (It is not necessary to 
answer this question on the basis of the postulates; you are free to use all the algebra 
that you know.) 

15. Suppose that division is regarded as an operation. Does this operation obey the 
associative law? That is, is the equation, (a/b)/c = a/(b/c), an algebraic identity? 
If not, under what conditions does the equation hold? 

16. Does subtraction obey the commutative law? How about division? 

The answers to the preceding three questions indicate why we do not regard subtrac- 
tion and division as basic operations when we are formulating the basic properties of the 
real numbers. : 

17. Postulate M-6 (which says that 1 = 0) may seem superfluous. Is it? Can it be 
proved, on the basis of the other postulates, that there is any number at all other than 0? 

18. Suppose that the only elements of R were 0 and 1 with addition and multiplication 
defined by the following tables. 


+ 0 1 . 0 1 
0 0 1 0 0 0 
1 1 0 1 0 1 


Which of the postulates would hold true? Which, if any, would fail to hold? 
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1.3 FIELDS 


An algebraic structure satisfying the postulates of the preceding section is 
called a field. Since we have expiained that R denotes the set of real numbers, 
it may be worth while to state the definition of a field over again from the beginning, 
allowing the possibility that the field may not be the real number system. 

Given a set F, of objects called numbers, with two operations + and -, called 
addition and multiplication. The structure 


IF, +] 
is called a field if the following conditions hold. 
A-1. F is closed under addition. 
A-2. Addition in F is associative. 
A-3. F contains exactly one number 0 such that 


a+0=0+a=a 
for each a in F. 


A-4. Every a in F has exactly one negative —a in F such that 
a+ (—a) = (—a) + a = 0. 

A-5. Addition in F is commutative. 

M-1. F is closed under multiplication. 

M-2. Multiplication in F is associative. 


M-3. F contains exactly one element 1, such that al = la = a for every a 


M-4. Every a ~ 0 in F has exactly one reciprocal a^! such that 
aa^! = am 
M-5. Multiplication in F is commutative. 
M-6. 1 + 0. 
AM-1. For every a, b, c in F, we have 
a(b + c) = ab + ac. 


All the theorems of the preceding section were proved merely on the basis of the 
above postulates. It follows that all these theorems hold true not merely in the 
real number system but in any field. For example, they all hold true in the algebraic 
system described in Problem 18 of Problem Set 1.2. 
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Prosiem SET 1.3 


The purpose of this problem set is merely to clarify the meaning of the postulates for 
afield. In answering the questions below, you may make free use of all the algebra which 
in fact you know. In the following section of the text, we shall return to our “official” 
mathematics, based on postulates. 


1. Let F be the set of all numbers of the form p/2*, where p and q are integers and 
q = 0. These numbers are called dyadic rationals. Do the dyadic rationals form a field 
under the usual definition of + and -? Which, if any, of the field postulates fail to hold? 

2. Let F be the set of all complex numbers with absolute value = 1. Does F forrff a 
field, under the usual definitions of + and -? Which, if any, of the field postulates fail to 
hold? (You may assume, of course, that the set of all complex numbers forms a field; 
it does.) 

3. Same question for the set of all positive real numbers. 

4. Same question for the set of all real numbers of the form a + b4/2, where a and b 
are rational. 

5. An algebraic structure [F, +, +] is called a commutative ring with unity if it satisfies 
all of the field postulates except possibly for M-4. Obviously every field is a commutative 
ring with unity, but not every commutative ring with unity is a field. Exactly one of the 
algebraic structures described in the preceding problems forms a commutative ring with 
unity but does not form a field. Which one is it? 1 

6. In the algebra of the real numbers, the following theorem holds. 


Theorem. If aibe — azbı # 0, then the system of equations 
ayz+biy =c, azt + bzy = 63, 


is satisfied by exactly one pair of numbers (z, y). 
Does this theorem hold true in any commutative ring with unity? Does it hold true in 
any field? 

7. Consider a coordinate plane, with points identified by pairs (z, y) of numbers. We 
define the “sum” of two points (u, v) and (z, y) to be the point (u + z, v + y). 

Does this system satisfy A-1 through A-5? Is it possible to define the “product” of 
two points in such a way as to get a field? If so, how? 


1.4 THE ORDERING OF THE REAL NUMBERS 


We remember that the real numbers can be thought of, informally, as being ar- 
ranged on a line like this: 
1 


-V2 =a 
- =$ -1 0 1 2 3 
Ficure 1.2 


When we write a < b, this means, graphically speaking, that a lies to the left of b 
on the number scale. Thus —2 < 1, and —1,000,000 < js, although the former 
number may, in a way, seem “bigger.” 


THE ORDERING OF THE REAL NUMBERS 1 


The laws governing the relation < are as follows. 


O-1. Every pair of real numbers (a, b) satisfies one, and only one, of the con- 
ditions a < b, a = b, b < a. 


O-2. Ifa < band b < c, then a < c. 


The expression a < b is pronounced “a is less than b." When we write b > a, 
this means (by definition) that a < b. When we write 


a S b, 


this means that either a < b or a = b. The relation < is connected up with 
addition and multiplication by the following conditions. 


MO-1. If a > O and b > 0, then ab > 0. 
AO-1. If a < b, thena +c < b + c for every c. 


From these four conditions (together with our other postulates), all of the laws 
governing inequalities can be derived. Let us take some examples. 


Theorem 1. Any two inequalities can be added. That is, if 


a<b 
and 
c « d, 
then 
a+e<b+d. 
Proof. By AO-1, 
a+c<b+c; 
by AO-1, 
b+e<b+d. 


(From now on, we are going to use the commutative law and similar principles 
without comment.) By O-2, this means that 

at+e<b+d, 
which was to be proved. 


Theorem 2. a < bif and only if b — a > 0. 


Proof. If a < b, then a — a < b — a, by AO-I. Therefore b — a » 0. 
Conversely, if b — a > 0, then b — a+a> a, and b >a. 


Theorem 3. An inequality is preserved if we multiply both sides by the same 
positive number. That is, if 
a « b, 
and 
€ 0; 
then 
ac < be. 
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Proof. Since a < b, we have 


b—a>O0. 
Therefore 
c(b — a) > 0, 
by MO-1. Hence 
bc — ac » 0, 


by Theorem 2; this means that ac < be, which was to be proved. 


Theorem 4. If a > 0, —a « 0. 


Proof. Ifa > 0, thena — a > 0 — a, by AO-1. Thus 0 > —a,and —a < 0, 
which was to be proved. 


Theorem 5. If a < 0, then —a > 0. 


Proof. If a < 0, then 0 — a > 0, by Theorem 2. Therefore —a > 0, which 
was to be proved. 


Theorem 6. An inequality is reversed if we multiply both sides by the same nega- 
tive number. That is, if 


a « b, 
and 
60 
then 
ac > be. 
Proof. Ifa < b, then 
b—a>0, 
by Theorem 2. If 
e « 0; 
then 
6 o; 
by Theorem 5. "Therefore 
—be 4- ac > 0, 
by MO-1. Therefore 
ac > be, 


by Theorem 2. 
Suppose that we have an inequality involving an unknown number z, such as 
2r — 5 < 7x +3. 


Every number z either satisfies the inequality or doesn't. For example, r — 1 
satisfies the inequality, because —3 « 10; but z — —2 does not, because 
—9 > —1l. An expression of this sort, involving a letter for which we can 
substitute anything we want, is called an open sentence. When we substitute 
1 for x, we get the statement —3 < 10, which is true. When we substitute —2 
for x, we get the statement —9 < —11, which is false. The set of all numbers 


— —P:22s: 
eee 
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which give true statements when substituted for z is called the solution set of the 
open sentence. Here are a few examples. 


Open sentence Solution set 


z+2=7 {5} 
z+0=2 R 
z2—4-—0 (2, —2} 
:-222-c-x R 


In the column on the right, (5) denotes the set whose only element is 5, and 
(2, —2) denotes the set whose elements are 2 and —2. The same notation is used 
whenever we want to describe a finite set by giving a complete list of its elements. 


For example, (1, 3,5, 7,9) 


is the set of all positive odd integers less than 10. The curly brackets are used in 
describing sets, rather than sequences, and so the order in which the elements are 
listed makes no difference. For example, 


{1, 3,5, 7,9} = {7, 3, 9, 1, 5}; 


the sets described are exactly the same. 

It sometimes happens, of course, that an open sentence never becomes a true 
statement, no matter what you substitute for z. For example, the equation 
(x + 1)? = x? + 2-2 has no roots at all. In this case, the solution set is the 
empty set, that is, the set that has no elements at all. The empty set is denoted 
by 6, to avoid confusion with the number 0. Here are more examples: 


Open sentence Solution set 
a? + 2x +2 = (x + 1)? 9 
22-0 {0} 
<z 9 
20r — 0 {0} 
There is a short notation for the solution set of an open sentence. When we 
it 
Nox t? = 0}, 


this means the set of all real numbers z such that z? = 0. Thus 
(zz? = 0} = {0}, 


(zz? — 52 + 6 = 0} = {2,3}, 
and so on. 
To “solve” an equation or an inequality means to find the solution set of the 
corresponding open sentence. For inequalities, the “answer” usually takes the form 
of a second open sentence which is simpler and easier to interpret than the first. 
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The simplification process might look like this: If 


(1) 2z — 5 < 7z 4- 8, 
then by AO-1 we have 
(2) —5e — 5 < 3, 
and 
(3) —5r < 8. 
By Theorem 6, we have 
(4) z> —$. 


(We have multiplied by the negative number —1.) Therefore every number x which 
satisfies (1) also satisfies (4). Conversely if (4) holds, then so does (3); if (3) holds, 
then so does (2), and if (2) holds, then so does (1). The inequalities (1) and (4) 
are called equivalent; by this we mean that every number which satisfies one of 
them also satisfies the other. Expression (4) is called the solution of (1). The 
process that we have gone through can be written in an abbreviated form as 
follows: 
2r — 5 < 7z + 3, 
€ —5e — 5 <3, 
€ —5z < 8, 
=2z> —§. 


The double-headed arrow on the left should be pronounced “is equivalent to”; 


when we write 
27 — 5 < Tr +3 er> —4, 


we mean that the open sentences connected by the symbol €» have exactly the 
same solution set. The advantage of the abbreviation is that it makes it easy to 
write at each stage exactly what we have on our minds. (When we write long strings 
of formulas, it is not always easy to tell or to remember what the logical connec- 
tion between them is supposed to be.) The result of our work on this little problem 
can be written as follows: 


{z|2e — 5 < 7z —3) = {ale > —§}. 


We use a single-headed arrow to indicate that one condition implies another. 
For instance, when we write 


12222?» 4, 


we mean that if z > 2, then z? > 4. This is true, because if z > 2, then x? > 2z, 
;by Theorem 3. Also by Theorem 3, if z > 2, then 2z > 4. By O-2, z? > 4, 
which was to be proved. 

` Note that it is not true that 


z»2e62*4, 


because any number less than —2 satisfies the second inequality but not the first. 
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(As one step in the proof above, we showed that x > 2-2 x? > 2x. Is it true 
that x? > 2x 2 z > 2? Why or why not?) 
The absolute value of a number « is denoted by |x|. It is defined by the following 


two conditions. 
(1) Ifz = O, then |r| = z. 


(2) Ifx < 0, then |z| = —z. 

For example, |2| = 2, because 2 2 0, and |-2| = —(—2) = 2, because —2 < 0. 
In other words, the absolute value of a positive number is the same positive number, 
and the absolute value of a negative number z is the corresponding positive number, 
which is —2. 

Theorem 7. For every z, |z| 2 0. 

Proof. (1) It x = 0, then |x| = 0, because in this case =. 

(2) If x < 0, then —z > 0. Therefore |x| > 0, because lz = =z. 

Theorem 8. |—2| = |z| for every z. 

Proof. (1) Ifx 2 0, then —z < 0. Thus 

|z| = a, 

and 


|-z| = —(—2) = z. 


Therefore, in this case, |—z| = |z]. 
(2) If z < 0, then --z > 0. Therefore 


|z| futu, 
and 
|—a| = —z. 


Hence, in this case also, |—2| = |z]. 


Theorem 9. |x| = x for every x. 


Proof. If x = 0, this is true because x 2 c. If x < 0, then x < |z|, because 
|z| = 0. 


Theorem 10. |xy| = |x|- |y] for every x and y. 


Proof. When x is replaced by —z, both sides of the equation are unchanged, 
therefore we can assume that x 2 0. For the same reason, we can assume that 
y = 0. If z = O and y Z 0, the equation takes the form zy — zy. 

When we write a “double inequality” 


a<b<e, 


we mean that both of the inequalities a < b and b < c hold true. 
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Theorem 11. Let a be > 0. Then 


jz| <a 
if and only if 
4 <2 «a. 


Graphically speaking, this theorem says that the numbers that satisfy the in- 
equality |x| < a are the numbers between a and —a like this: 


—a 0 a 
++ 
SS Eis 


|x] < a 


Figure 1.3 


Proof. (1) If x 2 0, then |r| < a means that x < a. Therefore |x| < a is 
true when 0 € x < a. ! 

(2) If x < 0, then |x| < a means that —z < a, or —a < x. Hence |x| < a 
is true when —a < z « 0. 

Therefore |x| <-a holds whenever —a < x < a. It is easy to check, conversely, 
that if |z| < a, then —a < x < a. (There are two cases to be considered, as 
in Conditions (1) and (2) above.) Therefore 


l| <a@—a <2 <a, 
which was to be proved. 


Theorem 12. For every a, b, 


la + b| S la| + [b]. 
Proof. case 1. Suppose that a + b = 0. In this case, 
ja+ b| — a 4- b. 
By Theorem 9, 
a S ja, and bajol. 
Therefore 


a+b < |a| + lèl, 


and since a + b = |a + b| in Case 1, the theorem follows. l 
casE 2. Suppose that a+b <0. Then (—a) + (—b) > 0. By our result 


for Case 1, we have 
\(—a) + (—b)| < |—a| + |—bl. 
But by Theorem 8, we know that 
|-« — b| = ja +b), [al = la, | |—b| = jbl. 


Substituting, we get 


la + | S Ja] + [b], 
which was to be proved. 
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Prosiem Ser 1.4 
1. Solve the following inequalities. The answers should be in one of the forms 
Sa Or” - uen we al} 


(a) 5— 3-2 >17+2 
(b) 5-2 —8 < 17+2+1 
() c+5>6—2 


(à) |z| < 1 
(e) |z — 3| < 2 
(f) |z — 5| < 5 


. Is it true that |z?| = |z|? for every z? Why or why not? 
. Is it true that |z3| = |x|? for every z? Why or why not? 
. Show that z? — 2z + 1 2 0 for every z. 
. For what numbers z (if any) does each of the following conditions hold? 
(a) |z? — 5-z4- 6| = |z — 3|: |z — 2] 
(b) |z? — 5-2+ 6| = 2? —5-2+6 
(c) |z — 5| = [2-2 — 3| 
(d) jz +1] = [i — z| 
(e) Ve? Fl =z 
( vz =l =z 
(g) [2-2 — 1| + lz + 3| 2 [3-2+2| 
(b) |T- z 4- 3| + |3 — z| 2 6|z + 1| 
6. Indicate graphically, on a number scale, the places where the following conditions 
hold. 
(a) |z| < 2 
(b |z — 2| < $ 
() |2-2 —3| <$ 
(d) |z — 1| < 2 and (also) |z — 2| < 1 
(e) | — 2-2] « $ 
( |z — 2 < and z > 2 
7. Show that if b = 0, then 


ao os w to 


HE 
| Tl 
8. Show that if b = 0, then 
ENS 
bi lèl 
9. Show that for every a and b, |a — b| = |a| — |b]. 
10. Show that for every a and b, |a + b| = |a] — |b]. 
11. For what numbers a is the fraction a/|a| defined? What is this fraction equal to, 
for various values of a? 
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1.5 ORDER RELATIONS AND ORDERED FIELDS 


So far, we have described the properties of the real number system relative to 
addition, multiplication, and order. A system which satisfies all the postulates 
that we have stated so far is called an ordered field. We repeat the definition of 
an ordered field in its general form, as follows. 

Given a set F. Let * be a relation defined on F, satisfying the following two 
conditions. 


O-1. Every pair a, b of elements of F satisfies one and only one of the conditions 
a * b, a — b, and b * a. 


O-2. If a * band b * c, then a * c. 
Then * is called an order relation. 


Order relations are usually denoted by the symbol «. But we need a more 
general notation, such as *, because we may want to talk about two different rela- 
tions defined on the same set. 

Suppose now that we have a field 


[F, +; jJ 


Suppose that we also have given an order relation «, defined on F, satisfying the 
following two conditions. 


MO-1. If a > O and b > 0, then ab > 0. 
AO-1. If a < b, then a +c < b -- for every c. Then the structure 


[F, A «] 
is called an ordered field. 


Thus what we have said, so far, about the real number system is that it 
forms an ordered field. 

It should be emphasized that an ordered field is not merely a field which is some- 
how arranged in an order. To know that we have an ordered field, we need to 
know that the order relation « is related to multiplication and addition by Con- 
ditions MO-1 and AO-1. 

In the preceding section, all the theorems were proved on the basis of Conditions 
O-1, O-2, MO-1, and AO-1. Therefore these theorems hold true in any ordered 
field whatever. You are free to use them in solving the following problems. 


PROBLEM Set 1.5 


1. Show that in any ordered field we have 0 < 1. [Hint: Show that each of the condi- 
tions 0 = 1, and 0 > 1isimpossible. Note that in this problem you are not being asked 
merely to show that the real number 0 is less than the real number 1.] 


2. In Problem 4 of Problem Set 1.3 you showed that the real numbers of the form 
a+ b4/2, where a and b are rational, form a field. Is this an ordered field, under the usual 
order relation? Why or why not? 
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3. Consider the field F described in Problem 18 of Problem Set 1.2. HereF = (0,1), 
and addition and multiplication are described by the following tables. 


E 0 ! , 0 1 
0 0 1 0 0 0 
1 1 0 1 0 kb 


Is it possible to define an order relation in such a way as to make this system an ordered 
field? 

4. Show that an order relation can be defined for the set of points (z, y) of a coordinate 
plane. (This is à somewhat difficult problem, although one solution is simple once you 
think of it.) 

5. Show that an order relation can be defined for the complex numbers. 

6. Show that it is not possible to define an order relation for the complex numbers in 
such a way as to get an ordered field. (Hint: Suppose that such an order has been defined. 
Show that each of the conditions t > 0, and i < 0 leads toa contradiction of one of the 
postulates or one of the theorems for ordered fields.] 


1.6 THE POSITIVE INTEGERS AND THE INDUCTION PRINCIPLE 


We get the positive integers by starting with 1, and then adding 1 as often as 
we like, Thus the first few positive integers are 
1, 
2=1+1, 
Bidet 220 EL dy 
4=3+4+1= pyei4+T+), 


and so on. We let N be the set of all positive integers. (Here N stands for natural; 
the positive integers are often referred to as the natural numbers.) 

The above common-sense remarks, about the way we get positive integers by 
adding 1 to other positive integers, suggest the pattern of our exact mathematical 
definition of the set N. The set N is defined by the following three conditions. 

(1) 1 belongs to N. 

(2) N is closed under the operation of adding 1. That is, if n belongs to N, then 
so also does n + 1. 

(3) Of all sets of numbers satisfying (1) and (2), N is the smallest. That is, N 
is the intersection, or common part, of all sets of numbers satisfying (1) and (2). 

From Condition (3) we get the following result immediately. 


Theorem 1. The Induction Principle. Let S be a set of numbers. If (1) S 

contains 1, and (2) S is closed under the operation of adding 1, then (3) S con- 

tains all of the positive integers. 

The reason is simple. Since N is the smallest set that satisfies (1) and (2), it 
follows that every other such set contains N. But this trivial looking theorem is 
of course very important. 
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Let us see how the induction principle, in the form in which we have stated it, 
can be put to work. 


Theorem A. For evcry positive integer n, the sum of the squares of the first 
n positive integers is (n/6)(n + 1)(2n + 1). That is, for every n we huve 


4 o%4...+n7= g (n+ Dn + 1). 
The induction proof is as follows. Let S be the set of all positive integers n for 
which it is true that ^ 
P2 opa =F Dn 1. 
Thus, if we say that 1 belongs to S, this means that 
1? = &0 + (2:1 + 1). 
To say that 2 belongs to S means that 
1? + 2? = 82 + 1)(2-2+4+ 1), 
and so on. 
We shall show that (1) S contains 1, and (2) S is closed under the operation of 
adding 1. 
(1) S contains 1 because the equation 
i 3 1? = 401 + 1)(2-1 +1) 
is a true equation. 
(2) To prove (2), we must show that if a given integer n belongs to S, then so 
also does n + 1. Thus we must show that if 
(a) 1742? 4-0-4 n? =F (nd Dn +t 1), 
then 


O) P42? +--+ n De PET n+ Hen + 3), 


(The first equation tells us that n belongs to S, and the second tells us that n + 1 
belongs to S.) 
Given that (a) holds, it follows that 


() PHH- n* (n 1)? =F (n+ Dn 4-1) + (n 1)? 


TEL (on? + n+ 6n + 6) 


= "E+ (en? + 7m + 6) 


TEL (n+ 2)(2n +3). 


Therefore (b) holds. 
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This is the way a proof based on Theorem 1 always works. Always, you are prov- 
ing that a certain opén sentence gives true statements for every positive integer 
n. Always, you start by letting S be the set of all positive integers in the solution 
set. You then show that your set S satisfies (1) and (2) of Theorem 1. Then you 
conclude from Theorem 1 that your solution set S contains all of the positive 
integers. 

The following alternative form of the induction principle may be more familiar. 


Theorem 2. Let 
P3, P3, +++ 


be a sequence of propositions (one proposition Ps for each positive integer 
n). If 

(1) P, is true, and 

(2) Pa implies P1 for every n, 
then i 

(3) all of the propositions P3, Pa, . . - are true. 


For example, we might consider the case where P, says that 
Pepe tar = sot DO FD. 


Thus the first few propositions in the sequence would be the following: 


p, i$ -d40- )Q-1-4 1), 
| Po: 422 = 224+ 02-24) 
| p 04243 F DU SFD, 
and so on. | 
Theorem 2 is a consequence of Theorem 1. To prove this, we begin the same 
way that we always begin when applying Theorem 1. We let S be the set of all 
positive integers n for which P; is true. Statement (1) now tells us that S contains 
1. Statement (2) tells us that S is closed under the operation of adding 1. By 
Theorem 1, S contains all positive integers. Therefore all the propositions P1, 
Po, . | . are true, which was to be proved. 
For future reference, we note yet à third form of the induction principle. 


Theorem 3. The Well-Ordering Principle. Every nonempty set of positive in- 
tegers has & smallest element. 


Proof. Let K be a nonempty subset of N. If K contains 1, then K has a least 
element, namely 1, and so there is nothing to prove. 

Suppose then that K does not contain 1. Let S be the set of all positive integers 
n for which it is true that K contains none of the integers 1,2; ...,n. (For example, 
if K were the set (10, 20, 30, . - }, S would be the set: (1,2, 3, ..., 9- 
would not belong to S, because 10 < 15, and 10 belongs to Kyo; 
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We know that (1) S contains 1, because K does not contain 1. If it were true 
that (2) S is closed under the operation of adding 1, then S would contain all of 
the positive integers, and K would be empty. Therefore S must not be closed under 
the operation of adding 1. Hence there is an integer n such that n belongs to S 
and n+ 1 does not. This means that K contains none of the numbers 1, 2,..., 
n, but does contain n + 1. It follows that n + 1 belongs to K and is the smallest 
element of K. 

(In the example given above, the smallest element of K is obviously 10. You 
should check to see that 10 is the number that we get when we apply the general 
proof to this particular set K.) 

The choice between Theorems 1 and 2 is merely a matter of taste. Theorem 1 
is generally considered more “modern,” and Theorem 2 more comfortable. But 
there are times when our third theorem is vastly easier to apply than the preced- 
ing two. (See, for example, the chapter on the theory of numbers.) 


1.7 THE INTEGERS AND THE RATIONAL NUMBERS 


If we add to the set N the number 0 and then all of the negatives of the numbers 
in N, we get all of the integers. The set of integers is denoted by Z. Thus 


ZEE OE ROR NY 


If a number z can be expressed in the form of p/q, where p and q are integers 
and q = 0, then z is called a rational number. The set of all rational numbers 
is denoted by Q. (Here Q stands for quotient; the rational numbers are those which 
are quotients of integers.) 

We would now like to prove the well-known fact that the rational numbers form 
a field. Under the scheme that we have been using in this chapter, the proof in- 
volves an unexpected difficulty. Following a procedure which is the reverse of 
the usual one, we have defined the positive integers in terms of the real numbers; 
and at the present stage we do not officially know that sums and products of 
integers are always integers. This can be proved, but we postpone the proof until 
the end of this chapter; in the meantime we regard the closure of the integers as 
a postulate. 


CL. The Closure Postulate. The integers are closed under addition and 
multiplication. 


The following theorem is now easy. 


Theorem 1. The rational numbers form an ordered field. 
Proof. We shall verify the field postulates one at a time. 
A-1. Closure Under Addition. 


p 8 PEIN E uel. qui My atr 4 Lagar 
ers te go Nar as’ Md an Wit Bary pore 


E is rational. 
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A-2. Since multiplication is associative for real numbers in general, it follows 
that multiplication is associative for rational numbers in particular. 


(This is an instance of a general principle. If a postulate says that a certain 
equation is an algebraic identity, then this postulate automatically holds in any 
subsystem of the given system.) 


A-3. Zero is rational, because 0 = 0/1. 
A-4. Given a rational number p/q, we have —(p/q) = (—p)/q, which is rational. 


A-5. Since addition is commutative for all real numbers, it is commutative for 
all rational numbers. 


M-1. p/q-r/s = pr/as, which is rational. 

M-2. See the verifications of A-2 and A-5. 

M-3. 1 is rational, because 1 =1/1. 

M-4. If p/q = 0, then p # 0. Therefore (p/q)~! = q/p, which is rational. 
M-5. See the verifications of A-2, A-5, and M-2. 


AM-1. The distributive law holds for rational numbers, because it holds for all 
real numbers. 


Thus Q forms a field. And the order relation <, given for all real numbers, 
applies in particular to the rational numbers, and the order postulates auto- 
matically hold. 2 

Finally we remark, by way of preparation for some of the problems below, 
that if a number is rational, =p/4, then it can be expressed as a fraction in lowest 
terms. That is, p and q can be chosen in such a way that no positive integer other 
than 1 is a factor of both of them. Thus, for example, if z = p/q, then x can be 
expressed as a fraction r/s, where r and s are not both even, are not both divisible 
by 3, and so on. Here we are really appealing to a theorem in the theory of numbers, 
to be proved in Chapter 29 at the end of the book. 


PnosLEM Ser 1.7 


1. A positive integer n is even if n = 2k, where k is an integer; n is odd if n = 2j + 1, 
where j is an integer. Show that every positive integer is either even or odd. [Hint: 
Let S be the set of all positive integers which are either even or odd. What you need to 
show is that S = N. Verify that S satisfies Conditions (1) and (2) of Theorem 1 of Sec- 
tion 1.6.] 

2. Show that if n is odd, then n? is odd. 

3. Show that if n? is even, then n is even. 

4. Show that if VZ = p/q, then p is even. 

5. Show that if VZ = p/q, then q is also even. 

6. Show that vZ is not = p/q for any integers p and q. 
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7. Show that every positive integer n has one of the three forms 3k, 3j + 1, or 3m + 2. 
8. Show that if n = 3j + 1, then n? has the same form. 

9. Show that if n = 3m + 2, then n? has the form 3k + 1. 

10. Show that if n? is divisible by 3, then so also is n. 

11. Show that 4/3 is irrational. 

12. Now try to use the same pattern of proof to "prove" that 4/4 is irrational. The 


“proof” must break down at some point because the theorem is ridiculous. Where does 
the proof break down? 


13. Show that if a, b, c, and d are rational, and 
a+ by2 = c+ dv2, 
then 
a=c and b=d 


*14. Let n and p be positive integers. Show that n can always be expressed in the form 
ki n= pcr 

where 0 € r < p. (Two of the preceding exercises assert that this is true for p = 2 
and p — 3) 


1.8 THE ARCHIMEDEAN POSTULATE; EUCLIDEAN COMPLETENESS 


It may appear that the postulates for an ordered field are an adequate descrip- 
tion of the real number system. But this is far from being the case; our postulates, 
so far, allow some strange possibilities indeed. We shall not discuss these, but 
merely state further postulates, enough to rule them out. 

Throughout this section, F is an ordered field. 

The easiest way to see the meaning of the following postulate is to think of it 
geometrically. Suppose we have given two linear segments like this: 


M 


Figure 1.4 


The case of interest is the one in which the first segment is “very long” and the 
second is “very short.” It is reasonable to suppose that if you take enough copies 
of the second segment, and lay them end to end, you ought to get a segment 
longer than the first one. And this should be true no matter how long the first 
may be, and no matter how short the second may be. If the length of the segments 
are the real numbers M and e, as indicated in the figure, and n copies of the second 
segment are enough, then we have 


ne> M. 


The algebraic form of this statement follows. 


ah od 


—_— Ve 
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A. The Archimedean Postulate. Let M and e be any two positive numbers. 
Then there is a positive integer n such that 


ne> M. 


An ordered field which satisfies this condition is called Archimedean. Henceforth 
we shall assume that the real number system forms an Archimedean ordered 
field. 

Note that if a certain integer n gives us ne > M, then any larger integer has the 
same property. Therefore the postulate might equally well have gone on to say 
that we have ne > M for every integer n greater than or equal to a certain no. 

Even our latest postulate, however, is still not enough for the purposes either of 
algebra or of geometry. The easiest way to see this is to observe that the field 
Q of rational numbers satisfies all our postulates so far, and in Q the number 2 
has no square root. We need to know that our number field is complete in such 
a sense as to permit the ordinary processes of algebra. For a long time to come, 
it will be sufficient for us to know that every positive number has a square root. 

If a > 0, then z is a square root of a if x? = a. Obviously, if x is a square root 
of a, then so also is —z. Therefore, if a number has one square root, it must have 
two. On the other hand, no number a has two different positive square roots 
zı, and z3; if this were so, we would have 

fa a= 2h 
zi LIE z3 =0, 
(zi — z2)(zi + 22) = 0. 


Here zı — z2 # 0, because Tı # £2, and zı + za > 0, because zı > 0. and 
xg > 0. Therefore the product (zı — z2)(zi + x2) cannot be = 0. 
An ordered field is called Euclidean if it satisfies the following condition. 


C-1. The Euclidean Completeness Postulate. Every positive number has a 
positive square root. 


We call this the Euclidean postulate because of the part that it will play in 
geometry. Eventually, this postulate will ensure that circles will intersect lines, 
and intersect each other, in the ways that we would expect. 

It follows, of course, that every a > 0 has exactly one positive square root. 
This is denoted by v/a. The other square root of a is —/a. We agree that 
V0 = 0. 


This terminology is a little confusing. Consider the following statements: 

(1) x is a square root of a. 

(2) z = Va. 

The second of these statements is not merely a shorthand transcription of the 
first. Statement (1) means merely that x? = a. Statement (2) means not only 


that x? = a but also that z = 0. Strictly speaking, it is never correct to speak 
of “the square root of a,” except when a = 0, because every a ~ 0 has either two 
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square roots or none at all (in the real number system). One way to avoid this con- 
fusion is to pronounce the symbol v/a as “root a," thus warning people that you 
are pronouncing a formula. 

Much later, we shall need another completeness postulate, to guarantee, for 
example, the existence of m. We shall postpone this discussion until such time 


as we need it. 
The following trivial looking observations turn out to be surprisingly useful. 


Theorem 1. For every real number a there is an integer n > a and an integer 
m « a. 


Proof. In getting n, we can assume a > 0. In the Archimedean postulate, 
take M = a, e = 1. This gives an n such that n-1 > a, as desired. To get 
m < a, we merely take n > —a, and let m = —n. 


Theorem 2. Between any two real numbers, there is at least one rational 
number. 


(Obviously there are more.) 


Proof. Given z < y. If there is a rational number r, with z +n <r < y +n, E 
then there is a rational number r’ = r — n between x and y. We may therefore i 
suppose that F 

lez dL 1 
€ 
S | 
———————Ó—3À- 
0 1 x y 
Fieure 1.5 
Let 
e-— y -— a. 
By the Archimedean postulate, we have 
pe »1 f 
for some integer p, Thus . 
2 « e. 
p 


The rational numbers with denominator p now divide the whole number line | 
into segments of length 1/p, like this: 
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Ficure 1.6 
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If k/p is the first one of them that lies to the right of z, then k/p ought to be be- 
tween x and y, because 


1 
> LEEY t 
p y 
To be more precise, let 
n 
K= fra]? j- 


By the well-ordering principle, K has a least element k. Thus 
k 


MS 
p 
but 
k—1 
g. 
p = 
Therefore 
TIL IIR 
s2z+(y—2) 
= y: 
Therefore 
k 
« —* y 
x $ V 


which was to be proved. 

In using the Archimedean postulate to prove this trivial looking theorem, we 
are not making any sort of joke; the postulate is needed. There are ordered fields 
in which the postulate fails. (See Chapter 28.) In such fields, Theorems 1 and 2 
do not hold true either; in them, some numbers z and y are greater than every in- 
teger and hence greater than every rational number. For many of the purposes 
of geometry we need the Archimedean postulate to rule out such phenomena. 


PnonLEM Ser 1.8 


1. Show that if 0 < z < y, then z? < y?. Does this conclusion follow if we know 
only that z < y? Why or why not? 
2. Show that if z, y > 0, and z? < y?, then z < y. 
. Show that if 0 < a < b, then Va < Vb. 
. Show that there is such a number as AA + V2. 


3 
4 
5. Same as Problem 4 for 4/2 — V2. 
6 


. Same as Problem 4 for /(3 — V2)/(7 — /18). 


7. Show that 1/+/2 cannot be expressed in the form a + b4/2, where a and b are ra- 
tional. [Hint: You need a theorem from the preceding set of problems.] 
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1.9 THE LANGUAGE AND NOTATION OF SETS 


So far, we have been using the language of sets rather sparingly and with a mini- 
mum of special notation. There is a standard shorthand, however, which is worth 
learning partly because it is widely used, and partly because it enables us to be 
both brief and exact in notebooks and on blackboards. 

Throughout this section, capital letters denote sets. If a is an element of A, 
then we write wea 
The symbol € is usually pronounced “belongs to." When we write a € A, this 
means that a does not belong to A. If every element of A is also an element of 
B, then A is called a subset of B, and we write 


ACB, 
BOA. 


Note that here we are allowing the possibility that A = B; that is, every set is 
a subset of itself. 

The intersection of A and B is the set of all objects that are elements of A and 
also elements of B. The intersection is denoted by A N B. (This is pronounced 
“A cap B," because the symbol N looks vaguely like a cap. Thus 


AnB-(ízxzeA and EB}. 


A word of caution is in order about the use of the word intersection. When we 
speak of the intersection of A and B, and write A N B, this allows the possibility 
that A N B is the empty set Ø. But when we say that two sets A and B intersect, 
we always mean that A and B have at least one element in common. This distine- 
tion in usage, between the noun and the verb, is not very logical, but it turns out 
to be convenient; besides, it is nearly universal, and there is not much to be done 
about it. 

The union of A and B is the set of all objects that are elements either of A or of 
B, or of both. The union is denoted by A U B. (This is pronounced “A cup B,” 
because the symbol looks vaguely like a cup.) Thus 


AUB= {zjrEA or cE B}. 


(Here, and everywhere else in mathematics, when we say “either... or... ,” we 
allow the possibility that both of the stated conditions hold. If we really mean 
*" |, but not both,” we have to say so.) 

The difference between two sets A and B is the set of all objects that belong to 
A but not to B. The difference is denoted by A — B. (This is pronounced “A 
minus B.”) Thus 

A—B= {zltEA and « ¢ B). 


Some books have been written making very free use of this symbolism, but this 
book is not one of them. Most of the time, we shall use words. We shall, of course, 
make constant use of the concepts represented by the symbols €, €, C, D, N, U, 
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and —. The following problem set is designed merely to give you a little practice 
in writing and interpreting the symbols. "These problems should be worked out on 
the basis of your “common sense" knowledge of how sets behave; in this book, 
we shall make no attempt to treat sets formally by means of postulates. 

Finally, we mention two common and useful blackboard abbreviations: 

(1) 3 means "there exists." 

(2) 3 means "such that." 
For example, the Euclidean completeness postulate C-1 might be stated as follows: 

If a € Randa > 0, then3z2z > 0 and z? — a. 


The symbol $ means "there does not exist." 


PnosLEM Ser 1.9 
Which of the following statements hold true for all sets A, B, C, .. - ? 


lACAUB 

2.A»25AnB 

3 ACANB 

4. An (B— A) =Í 

5. If AC B, then z E A »zeB 

6. If A C B and B C C, then AC C 

E Either A C Bor3a23a€ A anda € B 
8& A—BCA 

9. If A = ANB, then ACB 

10. If AC B, then A = ANB 

11. If A = AUB, then BC A 

12. (4 — B) n (4 — O) = A— (BUC) 
13. (A — B)N (AU B) -AnB 


1.10 n-FOLD SUMS AND PRODUCTS; THE GENERALIZED ASSOCIATIVE LAW 

There is a certain trouble with the associative laws. of addition and multiplica- 
tion. As they stand, they are not adequate to justify the things that we talk 
about, and the things we do, when we do algebra. At the end of Section 1.2, we 
remarked that it was quite all right to write triple produets abc, because (ab)c 
is always the same number às a(bc) ; and similarly for addition. In practice, how- 
ever, as soon as you get past Chapter 1 of anybody's book, you are writing n-fold 
sums 


a; tH az +e Ham 


and n-fold products 
0402 * * * n 
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for n > 3. We insert and delete parentheses in these sums and products, at will. 
All this is fine, but it has not been connected up, so farz with the operations that 
are supposed to be given for pairs of numbers (a, b) and with the associative laws 
for triplets (a, b, c). It would be a pity if mathematics appeared to be split down 
the middle, with the postulates and the careful definitions on one side, and the real 
mathematical content on the other. Let us therefore bridge the gap between our 
postulates and the things that we intend to do. 

The key to our problem is the induction idea. What is given in a field is the 
twofold product ab, for every a and b in the field. Given a;, a», az, we define 
the threefold product by the formula 


4,55 = (a,2)05. 
Similarly, we define 
4,050304 = (a,0503)a4, 


where the parenthesis on the right is defined by the preceding equation. In general, 


08102... Annyi = (aiaa eae @n)An41- 


That is, to form an (n + 1)-fold product, we first form the n-fold product (of the 
first n factors) and then multiply the result by the last factor. 

This is our official definition of the n-fold product. But there is another scheme 
that we might have used. We might have defined the triple product as 


4,0203 = a4 (a2a3). 
We could then have said, in general, that 
0102... Anny = a1(a5a3 sao Anani); 


that is, to form an (n + 1)-fold product, we could first form the product of the last 
n factors and then multiply the result by the first factor. In fact, the first thing that 
we need to prove is that the choice between these two schemes makes no difference. 
We get the same result in any case. In the following theorem, we change our nota- 
tion slightly, to avoid getting very peculiar looking statements forn = 1 andn = 2. 


Theorem 1. l'or every positive integer n, 
aba,a5...a, = a(baya2... an). 


Proof. Let S be the set of all positive integers for which this formula holds. 
We shall show, by induction, that S contains all of the positive integers. Thus we 
need to show two things: 


(1) S contains 1. 
(2) S is closed under the operation of adding 1. 


Proof of (1). S contains 1 if 
aba, = a(ba;). 
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Now aba, = (ab)a;, by definition. And (ab)a; = a(ba;), by the ordinary associa- 
tive law for triplets. Therefore S contains 1. 
Proof of (2). Here we need to show that if S contains n, then S also contains 


n+ 1. This means that if 
(i) abayag...@_, = a(baja2... an), 
then 
(ii) abayag...Qn@n41 = a(baya2... Os 41)- 
This is shown as follows. We have 
aba;a2 ... sn. 1 = (aba,a5 . . . à5)0n4-1; 
by definition. By (i), the expression on the right is 
= [a(baias . . . dn)|@n41- 
By the associative law, this is 
= a[(baia2 . . . Gy) Qn +1). 
By definition of the (n + 2)-fold product, the expression on the right is 
= a(ba,a5 . . . @nn41): 


Therefore (ii) holds. With the aid of this theorem, we shall prove the following 
theorem. 


Theorem 2. The General Associative Law. In any n-fold product, the insertion 
of one pair of parentheses leaves the value of the produet unchanged. 


Proof. Let S be the set of all positive integers n for which it is true that paren- 
theses can be inserted in any n-fold product without changing its value. To 
prove our theorem, we need to show that 


(1) S contains 1, and 
(2) S is closed under the operation of adding 1. 


Proof of (1). Obviously a, = (a;) for every a. Therefore S contains 1. 
Proof of (2). Given an (n + 1)-fold product 


4102 . . . OnGn41- 
Suppose that we insert a pair of parentheses. There are three cases to be considered. 
(i) The opening parenthesis comes somewhere after a1, as 
0103 ... Qi(Qi41 - » Gk) - - < Ongi- 


(Here we allow the possibility that k = n + 1.) 
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(ii) The closing parenthesis comes somewhere before an+- 
(iii) The parentheses enclose the entire product. 


Obviously, in Case (iii) there is nothing to prove. In Case (i), 
4103... Gi(Qiy1. Gk) ^. angi = Ailae... ai(aizi--- Gx) . « « Ona); 
by Theorem 1. This is 
= aj[a2... Qn 4.1]; 
because S contains n, and in turn, this becomes 


= 0402...0p, 


by Theorem 1. Thus, if S contains n, it follows that S contains n + 1. 
In Case (ii), we have 


4102... G;(Qi41... 4%)... nt, 


where k < n + 1, but i may be = 1. By definition of an (n + 1)-fold product, 
this becomes 
= [ayaa .. (0141... Gp)... OnlOn yr; 


which in turn becomes 
= [aja2... G5]an 41, 
because S contains n; this is 
= 04102... 0540541, 
' by definition of an (n + 1)-fold product. This completes our induction proof. 
The things that we ordinarily want to do with n-fold products can be justified 
by repeated applications of this theorem. For example, 
H 
b 


Proof. The left-hand number is given by 


OET 


by two applications of Theorem 2. This is 


wb luigi. 
[4 


— alld 
= (al)(1d) 
= ad. 


We define n-fold sums in exactly the same way and conclude by the same proof 
that n-fold sums satisfy the general associative law. That is, insertion of one pair 
of parentheses in an n-fold sum, 


a, + a2 d seb an 
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leaves the value of the sum unchanged. Finally we observe that we always have 
a(by + ba +++ + bn) = abı + aba + +++ + abn- 

The proof is by an easy induction. For n = 1, we have ab, = abı. Given that 
albi + be +++: + bn) = ab; + aba + +--+ abn, 

it follows that 


abı Eba +++ + bn + bnti) = albi + ba + +++ + bn) + bagal 
= a(b; + ba + +++ + bn) + abny1 

(ab, + aba + +++ + abn) + abs 

ab, + aba +--+ + abn + abs. 


1.11 THE CLOSURE OF THE INTEGERS UNDER ADDITION AND MULTIPLICATION 


We found, in Section 1.7, that to prove that the set Q of rational numbers 
forms a field, we needed to know that sums and products of integers are always 
integers. Under our definition of integers, this requires proof; the proof is merely 


a series of exercises in the use of induction. 
We recall that the set N of positive integers was defined by the following three 


conditions: 
(1) N contains 1, 
(2) N is closed under the operation of adding 1, and 
(3) of all sets of numbers satisfying (1) and (2), N is the smallest. 


To get the set Z of integers, we added to N the number 0, and also the negatives 
of all of the positive integers. 


Theorem 1. If a and n are positive integers, then so also is a + n. 


Proof. Let a be fixed. Let S be the set of all positive integers n for which 
a--n € N. Then (1) 1 € S, because N is closed under the operation of adding 1, 
and (2) if n € S, then n + 1 € S. For if a + n EN, we have 


a+ (n -- 1) = (a c n) +1, 
which belongs to N. 


Theorem 2. If a and n are positive integers, then so also is an. 
Proof. Let a be fixed, and let 
S = {nlan € Nj. 
Then (1) 1 € S, because al = a. (2) If n € S, then n + 1€ S. For 
a(n + 1) = an + al = an t a, 


which is the sum of two positive integers. 
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Theorem 3. If z and. y are integers, so also is zy. 
Proof. If xz, y > 0, this follows from Theorem 2. If z > 0 and y < 0, then 


Ty — —[z(—v)]; 


which is the negative of the positive integer z(—y). The case x < 0, y > 0 is the 
same. If z, y < 0, then zy is the positive integer (—x)(—y). Finally, if z = 0 
or y = 0, we have zy = 0, which is an integer. This takes care of closure under 
multiplication. Rather oddly, addition is a little more troublesome. 


Theorem 4. Every positive integer is either —1 or =k + 1 for some positive 
integer k. 


Proof. Let S be the set of all positive integers satisfying the conditions of the 
theorem. Then obviously 1 € S. If n € S, and n = 1, then n+ 1 € S, with 
k = 1. If n € S, and n = 1, then n = k + 1 for some positive integer k. There- 
fore n + 1 = (k + 1) + 1, which has the desired form. 


Theorem 5. If n € Z, then n — 1 € Z. 


Proof. (1) If n > 0, then n = k + 1 for some k in Z. Therefore n — 1 = 
k ez. 


(2 Ifn = 0, then n — 1 = —1 EZ. 
(3) If n < 0, then n = —k, where k > 0. Therefore n — 1 = (—k) + 
(—1) = —(k + 1), which is the negative of a positive integer. 


Theorem 6. Ifa €Z and » EN, then a — n € Z. 


Proof. Let S = {nla — n € Z}. Then (1) 1 € S, by Theorem 5, and (2) if 
n E S, then n + 1 E€ S. For if a — n E Z, then 


a — (n + 1) = (a — n) — 1, 
which belongs to Z by Theorem 5. 
Finally, we have the following theorem. 
Theorem 7. If x, y E Z, then z + y € Z. ' 


Proof. case 1. If either z = 0 or y = 0, this holds trivially. 
CASE 2. If z, y > 0, then x + y EN, and so belongs to Z. 
CASE 3. If z, y < 0, then 


z-cy-— —[(—2) + E 


which is the negative of the positive integer (—x) + (—y). 


case 4. If z <0 < y, letn = —z. Then n > 0, andz--y — y — n. We 
know by Theorem 6 that y — » € Z. 
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All this, it must be confessed, is rather tedious; but the tedium is unavoidable. 
We need to know about the integers, the real numbers, and the relation between 
them. One way to do this is first to set up the integers, and then to build the real 
numbers from them. (For such a treatment see, for example, Landau’s Founda- 
tions of Analysis.) In this chapter we have first set up the real numbers and then 
“moved from the top downward” to get the integers. The latter scheme is by far 
the simpler and easier. But we cannot expect to get something for absolutely 
nothing. 


CHAPTER 7 


INCIDENCE GEOMETRY IN PLANES 
AND SPACE 


You will recall that when we started to discuss the real numbers from a postula- 
tional point of view, we began with three things: a set R (whose elements were 
called numbers) and two laws of combination (called addition and multiplication, 
and denoted by + and -). Thus, in Section 1.2, the structure that we were working 
with was a triplet [R, +, -], where R was a set and + and - were operations defined 
in R. A little later, we assumed that we had an order relation <, defined in R 
and subject to certain conditions. Thus, at the end of Chapter 1, the structure 
that we were working with was a quadruplet [R, +, , <]; and all of our postulates 
were stated in terms of these four objects. 

We shall follow the same scheme in our postulational treatment of the geometry 
of planes and space. In the scheme that we shall be using, space will be regarded 
as a set S; the points of space will be the elements of this set. We will also have 
given a collection of subsets of S, called lines, and another collection of subsets of 
S, called planes. Thus the structure that we start with is a triplet 


(S, £, 6], 


where the elements of S, £, and @ are called points, lines and planes, respectively. 
Later, we shall add to this structure, just as we added to our algebraic structure 
in the latter part of Chapter 1. For the present, however, our postulates are going 
to be stated in terms of the sets S, £, and @. 

The above presentation is equivalent to one in which we say that the terms 
point, line, and plane are taken as undefined. 

In our formal mathematics, we are going to use postulates; and the only things 
that we shall claim to know about points, lines, and planes will be the things 
stated in the postulates. Informally, however, it may be a good idea to remind 
ourselves of the sort of things that lines and planes will turn out to be. A line 
is going to stretch out infinitely far in both directions like this: 


<< 


FIGURE 2.1 
36 
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Here the arrowheads are supposed to indicate that the line doesn’t stop where 
the picture of it stops. We shall have another term, segment, for a figure which 
looks like this: 


Pe—— -— +0 
Figure 2.2 


If the end points are P and Q, then this figure will be called the segment from 
P to Q. Fora "liae" to stretch out infinitely far in only one direction is not enough: 


P Q 
Figure 2.3 


A figure like this will be called a ray. Similarly, a plane stretches out infinitely 
far in every direction. Thus the floor of your room would not form a plane, even 
if it were perfectly flat. It would form a part of a plane, and a rather small part, 
at that. 

Logically speaking, we are getting ahead of ourselves when we draw these 
pictures. The postulates of this section are nowhere nearly enough to guarantee 
that lines look like our pictures, as you will see in the next set of problems. 

Our first postulate is merely a reminder. _ 


1-0. All lines and planes are sets of points. 

If a line L is a subset of a plane E, then we shall say that L lies in E. (The same 
term is used in general, to mean that one set is a subset of another.) If a point 
P belongs to a line L, then we may say that P lies on L or that L passes through P. 
Similarly, if P belongs to a plane E, then we may say that P lies in E or that E 
passes through P. (Here, of course, we are merely defining the familiar colloquial 
language of geometry in terms of the set-theoretic apparatus that is used in our 
postulates.) By a figure we mean a set of points. 

Points lying on one line are called collinear, and points lying in one plane are 
coplanar. . 


l-1. Given any two different points, there is exactly one line containing them. 


If the points are P and Q, then the line containing them is denoted by TQ. 
The arrowheads are meant to remind us of the usual representation of lines in 
figures. 

1-2. Given any three different noncollinear points, there is exactly one plane 

containing them. 

If the three points are P, Q, and R, then the plane containing them is denoted 
by PQR. 


1-3. If two points lie in a plane, then the line containing them lies in the plane. 


1-4. If two planes intersect, then their intersection is a line. 
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If you check back carefully, you will see that postulates I-0 through I-4 are 
satisfied by the poverty-stricken “geometry” in which there is exactly one point 
P in S, and this point P is both a line and a plane. To rule out, at least, such 
extreme trivialities, we state another postulate immediately. 


1-5. Every line contains at least two points. Every plane contains at least three 
noncollinear points. And S contains at least four noncoplanar points. 


(Throughout this book, if we say that “P and Q are points,” we allow the pos- 
sibility that P = Q. But if we speak of “two points,” we mean that there are 
really two of them; that is, the points must be different; similarly for planes, and 
so forth. Sometimes we may speak of “two different points,” as in I-1, but this is 
merely for emphasis.) 


Theorem 1. Two different lines intersect in at most one point. 


Proof. Let Lı and Lz be two lines, and suppose that their intersection contains 
two points P and Q. This is impossible by Postulate I-1, because I-1 says that 
there is exactly one line, and hence only one line, containing P and Q. 


Theorem 2. If a line intersects a plane not containing it, then the intersection 
is a single point. L 


Figure 2.4 


Proof. Let L be a line intersecting a plane E. We have given that L N E con- 
tains at least one point P; and we need to prove that L N E contains no other 
point Q. 

Suppose that there is a second point Qin L N E. Then L = PQ, by Theorem 1. 
By 1-3, PO lies in E. Therefore L lies in E, which contradicts the hypothesis for L. 


Theorem 3. Given a line and a point not on the line, there is exactly one plane 
containing both of them. 


RESTATEMENT. Let L be a line, and let P be a point not on L. Then there is 
one and only one plane containing L U P. 


(Here we introduce a device which will be convenient later. Whenever we can, 
we shall state theorems in ordinary English, with little or no notation. This way, 
the theorems are easier to read and to remember. The restatement furnishes us 
with the notation that will be used in the proof, and in some cases it may remove 
some vagueness or ambiguity.) 


/—————— 
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Proof. (1) By 1-5, L contains at least two points Q and R. 

(2) P, Q, and R are not collinear. The reason is that by I-1, L is the only line that 
contains Q and R; and L does not contain P. Therefore no line contains P, Q, and R. 

(3) By (2) and 1-2, there is a plane E — PQR, containing P, Q, and R. By 
I-3, E also contains L. 

Thus there is at least one plane containing L U P. If there were two such 
planes, then both of them would contain P, Q, and R. This is impossible, by I-2, 
because P, Q, and R are noncollinear. 


Theorem 4. If two lines intersect, then their union lies in exactly one plane. 


Let L and L' be two intersecting lines. The following statements are the main 
steps in the proof; you should be able to supply the reasons for each of these 
statements. 


(1) L NL’ is a point P. (2) L' contains a point Q ¥ P. 
(3) There is a plane E, containing L and Q. , 
(4) E contains L U L’. (5) No other plane contains L U L’. 


Theorems of the kind that we have just been proving are called incidence theorems; 
such a theorem deals with the question whether two sets intersect (and if so, how?) 
or the question whether one set lies in another. Incidence theorems are used con- 
stantly, but the incidence postulates on which they are based do not go very far in 
describing space geometry, as Problem 1 below. will indicate. 


PROBLEM Ser 2.1 


1. Consider the system [S, £, ®], where S contains exactly four points A, B, C, and D, 
the lines are the sets with exactly two points, and the planes are the sets with exactly 
three points. This “space” is illustrated by the following figure: 


A 


c 


Figure 2.5 


Here it should be remembered that A, B, C, and D are the only points that count. 
Verify that all the incidence postulates hold in this system. 

2. Let Pı, P2,..., Ps be five points, no three of which are collinear. How many 
lines contain two of these five points? 

3. If no four of the five points are coplanar, how many planes contain three of the 
five points? 

4. Given Pı, P2,..., Pn, all different such that no three of them are collinear and 
no four of them are coplanar. How many lines contain two of them? How many planes 
contain three of them? 


CHAPTER 3 


DISTANCE AND CONGRUENCE 
ERES EUM mace vii ee 


3.1 THE IDEA OF A FUNCTION 


The word function is most commonly used in connection with calculus and its 
various elaborations, but the idea occurs, often without the word, in nearly all 
mathematies. In fact, the first two chapters of this book have been full of functions, 
as we shall now see. 

(1) In a field F, to every element a there corresponds a unique negative, —a. 


Here we have a function 
FF 


under which 
a — —a 
for every a. 

(2) In an ordered field F, to every element x there corresponds a unique number 
Iz], called the absolute value of x. The rule of correspondence is that if x EU, 
then the number corresponding to x is x itself, and if x < 0, then the number 
corresponding to x is —x. Thus we have a function, 


' F — F, 
under which 
x— |z| 

for every x. 

(3) Suppose that F is a Euclidean ordered field. Let F* be the set of all elements 
of F that are = 0. To each element a of F* there corresponds a unique element 
Va of F*. (Recall the Euclidean completeness postulate and the definition of 
Va.) Here we have a function, 

Ft — Ft 


for every a in F+, ORB fe 

(4) The operation of addition in a field F can be considered as a function once 
we have the idea of the product of two sets. For any pair of sets A, B, the product 
A X B is the set of all ordered pairs (a, b), where a € A and b € B. We allow 
the possibility that A = B. 'Thus, when we identify a point. P of a coordinate 
plane by giving a pair of coordinates (x, y), we are associating with P an element 
of the product R X R of the real numbers with themselves, 
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under which 
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Consider now the operation of addition in a field F. Under this operation, to 
every pair (a, b) of numbers in F there corresponds a number a 4- b, called their 
sum. This can be regarded as a function, 


FXF-—F, 
where 
(a, b) — a+b 
for every (a, b) in F X F. 

Obviously multiplication can be regarded in the same way. 

Note that in these situations there are always three objects involved: first, a 
set A of objects to which things are going to correspond; second, a set B which 
contains the objects that correspond to elements of A ; and third, the correspondence 
itself, which associates with every element of A a unique element of B. The set 
A is called the domain of definition, or simply the domain. The set B is called the 
range. The correspondence itself is called the function. In the examples that we 
have been discussing, these are as follows. 


TABLE 3.1 


Domain Range Law 


a — —a 
a — |a] 
a — va 
(a,b) > a+b 


In the third column, we have described the function by giving the law of corre- 
spondence. 

A less simple example would be the function which assigns to every positive 
real number its common logarithm. Here the domain A is the set of all positive 
real numbers, the range B is the set of all real numbers, and the law of correspond- 
ence is x — logyo. Here the expression logio z is an example of functional 
notation. If the function itself is denoted by f, then f(x) denotes the object cor- 
responding to x. For example, if f is the absolute value function, then 


and so on. Similarly, if g is the “positive square root function,” then 


and so on. We can also use functional notation for addition, if we want to (which 
we usually don't). If s is the “sum function,” then 


s(a, b) = a 4- b, 


80 that 
8(2,3) — 5,  .8(5,4) = 9, 
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and so on. Similarly, if p is the “product function,” then 


p(a, b) = ab, 
so that 
p(5, 4) = 20 and p(7, 5) = 35. 


To sum up, a function f is defined if we describe three things: (1) a set_A, called 
the domain, (2) a set B called the range, and (3) a law of correspondence under 
which to every element a of A there corresponds a unique element b of B. . If 
a € A, then f(a) denotes the corresponding element of B. We indicate the func- 
tion f, the domain A, and the range B by writing 


f:A — B, 


and we say that f is a function of A into B. 
We define composition of functions in the way which is familiar from calculus. 
Thus, given 


J[:A—B 
and 

g:B—C, 
we can write 

c = g(f(a)); 


this means that c corresponds to f(a) under the function g. For example, if we are 
using the functions s and p to describe sums and products, then 


s(p(a, b), p(a, c)) 
means ab 4- ac, and 


p(a, s(b, c)) 
means 


a(b + c). 


Finally, we define two special types of function which have special importance. 
If every b in B is = f(a) for at least one a in A, then we say that f is a function 
of A onto B. If every b in B is = f(a) for exactly one a in A, then we say that f 
is a one-to-one correspondence between A and B, and we write 


f:A e B. 


For example, the function F:R — R, z — x? is a one-to-one correspondence. 
The function g:R + R, x > z? is not a one-to-one correspondence because, in 
the range, every positive number appears twice, and no negative number appears 
at all. The function under which z + —z is 4 one-to-one correspondence. (Proof? 
You need to check that each number yis = —z for exactly one number x.) Simi- 
larly, the function z — 1/z is a one-to-one correspondence; here 


A — B — (z|x = 0}. 
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If f is a one-to-one correspondence, then there is a function, 
fU:BeA, 


called the inverse of f, which reverses the action of f. That is, f^! (b) = aif f(a) = 
b. The symbol f^! is pronounced “f-inverse.” When we say that a function has 
an inverse, this is merely another way of saying that the function is a one-to-one 
correspondence. 


Given a function 
f:A >B. 


The image of A is the set of all elements of B that appear as values of the function. 


Thus the image d 
m (ble € À and b= f(a))- 


In other words, the image is the smallest set that could be taken as the range of 
the function. For example, if the function 


f; RR 


is defined by the condition f(z) = x? for every z, then the range is the set R of 
all rea] numbers, and the image is the set of all nonnegative real numbers. 

The question may arise why we define functions in such a way as to permit 
the range to be a bigger set than the image. We might have stated the definition 
in such a way that every function would be onto. But such a definition would be 
quite unmanageable. For example, suppose that we define a function, in calculus, 
by the equation 

f(x) = zt — 7x3 + 8x? — 17x + 3. 


This is a function of R into R. To find out what the image is, we would have to 
find out where this function assumes its minimum; this is a problem in calculus, 
leading to a rather difficult problem in algebra. If we required that the image be 
known for the function to be properly defined, we could not state our calculus 
prc'A»m without first solving it; and this would be a very awkward proceeding. 


PnonLEM Ser 3.1 


1. Using the functional notations s(a, b) and p(a, b) for sums and products, rewrite the 
associative, commutative, and distributive laws for an ordered field. Now rewrite the 
Postulates MO-1 and AO-1 which related the field structure to the order relation in an 
ordered field. 

2. We recall that Z is the set of all integers. For each i, j in Z, let f(i, j) be the larger 
of the two integers i andj. Do these remarks define a function? If so, what are the domain 
and the image? 

3. Let f:R — R be defined by the condition f(z) = z?. Does f have an inverse? 
Why or why not? 

4. Let R* be the set of all nonnegative real numbers. Let g: R* — R* be defined by 
the condition g(z) = z?. Does g have an inverse? Why or why not? 
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5. The same question is asked for f: R — R, f(x) = sin z. 
6. Let A be the closed interval [—7/2, 7/2]. That is, 
ae 


A= fizer and TES 


IA 
t2|3 


B = [-1,1) = @JzER and —1 £ 


IA 
E 
IA 


Let 
g4 >B 


be the function defined by the condition 
g(x) = sin z. 


Does g have an inverse? Why or why not? 


3.2 THE SET-THEORETIC INTERPRETATION OF FUNCTIONS AND RELATIONS 


In the preceding section, we have explained, with numerous examples, what 
people are talking about when they talk about functions. And in fact the idea of 
a function, in the form in which we have explained it, is adequate for nearly all 
the uses that you will need to make of it for a long time to come. 

You will find, however, if you re-read the last section carefully, that at no point 
have we given a straightforward definition of a function; we have explained the 
conditions under which a function is defined, but we have not said what kind of 
object a function is. This we shall now do. But first we shall give some prelimi- 
nary discussion, to indicate the idea behind the definition. 

Consider first the case where the domain is a finite set, say, 


A = {0,1, 2,3, 4, 5}. 


For any function f with A as domain, we can write down a complete table, giving 
the values of the function f. 


a | f(a) 
0 0 
1 1 
2 4 
3 3 
4 4 
5 1 


There was a system used in making up the table; and you may be able to figure 
out what this system was. But even if you can't figure it out, the function is 
perfectly well defined by the table. To define a function, you have to explain what 
its value is, for each element of the domain A, but you don't necessarily have to 
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give this explanation tersely. In fact, if you remember the kind of functions that 
were important in calculus, you will recall that it took quite a while to explain what 
they were. The expression sin x was not a formula for the sine function, but merely 
a name for the sine function; and the explanation of what sin x really meant was 
given in words, at considerable length. For a function like the sine, you could 
write down only a partial table of values, because the domain A was the set of 
all real numbers, which is infinite. Under the definition, however, a correspondence 
was defined under which to every x in R there corresponded a unique real number 
y which was sin x. 

From a finite table, such as the one that we have written above, it is easy to 
read off a set of ordered pairs which describe the function. Each line of the table 
gives us an ordered pair (a, b), in which a ‘s in A and b is the corresponding ele- 
ment of B. From our table for f, we get the pairs 


(0,0), (1,1), (2,4), (3,3), (4,4), (5 1). 


If the set A is infinite, then so is the table. 
A partial table for the sine might look like this. 


x sin x 

0 0 

T 0 
1/2 1 
7/3 4/3/2 
1/4 A/2/2 
=r 0 
—T/2 —1 


From this table we could read off a partial collection of ordered pairs 


(0,0, (m0) (-/2,), (7/3, V3/2),) 
(7/4, 2/2), (—7,0) (—7/2, —1) 


If we formed the set of all ordered pairs of the type (x, sin x), then this infinite 
collection would describe the sine function completely. Similarly every function 
can be described by a collection of ordered pairs. If the domain of the function is 
finite, then so is the collection, and if A is infinite, then so is the collection. If 
the collection describes a function, then every a in A must appear as the first 
term of exactly one pair in the collection, because the function assigns a unique 
value to a. 

Thus, given a function f: A — B, we have a collection of ordered pairs (a, b), 
where (1) a € A, (2) b € B, and (3) every element of A appears exactly once 
as the first term of an ordered pair in the collection. And conversely, given a 
collection of ordered pairs (a, b), satisfying (1), (2), and (3), we always have a 
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function f: A — B. These observations are the basis of the following definition. 
In this definition, we merely are saying that the function zs the sort of collection 
of ordered pairs that we have been discussing. 


Derinition. Let A and B be sets. A function with domain A and range B isa 
collection f of ordered pairs (a, b), such that 


(1) for each (a, b) inf, a © A; 
(2) each a in A is the first term of exactly one pair (a, b) in f; and 
(3) for each (a, b) in f, b E B. 
When we write b — f(a), we mean that (a, b) belongs to the collection f. From 
here on, we proceed to handle functions exactly as before. 
A somewhat similar device enables us to give an explicit definition of the idea 
* of a relation defined on a set A. We have been using this idea somewhat informally, 
writing a < b to mean that a has the relation < to b, and, more generally, a * b 


is written to mean that a has the relation * to b. Now, given a relation *, defined 
on the set A, we can form the collection 


((a, b)|a * bj. 


Conversely, given any collection of ordered pairs of elements of A, we can define 
a relation *, by saying that a * b if the pair (a, b) belongs to the collection. In 
the following definition, we are saying that the relation 7s the collection. Recall, 
of course, that A X A is the set of all ordered pairs of elements of A. 


Derinition. A relation defined on a set A is a subset of A X A. 


For example, let 
A zm {l, 2, 3} , 
and let 


* = ((1, 2), (1, 3), 2, 3). 


Then * isa relation. (It is, in fact, the usual relation <.) 
It is not necessary, of course, to denote relations by peculiar symbols. For 
example, if A = (1,2, 3}, as before, we may let 


G = (Q, 1), (3, 1), (3, 2)}. 


Thus 261, 361 and 3G2, because (2, 1), (3, 1), and (3,2) belong to G. (In fact, 
G is the relation >.) 


Prosiem Ser 3.2 
1. let A = (1,2, 3, 4). Let 
G= {(4, 2), (4, 1), (4, 3), (2, 1), (2, 3), (1, 3)}. 


Is G a relation? Is G an order relation? 
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2. Let A be as before, and let G be the set of all ordered pairs (a, b) such that a and b 
belong to A and a # b. Is G a relation? Is G an order relation? 


3. Is the following collection a function? If so, what are its domain and image? 
((0, 0), (1, 1), (2, 4), (3, 2), (4, 2), (5, 4), (6, 1)} 


Can you see a systematic way in which this collection might have been constructed? 
4. Is the following collection a function? 


(0, 1), (1, 0), (0, 0)} 


5. Let f be the set of all ordered pairs (z, y) such that z and y belong to R and y = 2?. 
Is this a function? 

6. The same question is asked for the set of all ordered pairs (z, y) such that z and y 
belong to R and z = y?. 

1. Consider a rectangular coordinate system in the plane, in the usual sense of analytic 
geometry. Every point has a pair of coordinates (z, y). For the purposes of this question, 
let us regard points as indistinguishable from the ordered pairs (z, y) that describe them. 
Thus every figure, that is, every set of points, becomes a collection of ordered pairs of 
real numbers. Under what conditions, if any, do the following figures represent functions? 


(a) & triangle 

(b) a single point 

(c) a line 

(d) & circle 

(e) a semicircle, including the end points 
(f) an ellipse 


What, in general, is the geometric condition that a figure in the coordinate plane must 
satisfy, to be a function? 


3.3 THE DISTANCE FUNCTION 
So far, the structure dealt with in our geometry has been the triplet 
[S, £, €]. 


We shall now add to the structure by introducing the idea of distance. To each 
pair of points there will correspond a real number called the distance between them. 
Thus we want a distance function d, subject to the following postulates. 


D-0. d is a function 
d:SxS—R. 


D-1. For every P, Q, d (P,Q) 2 0. 
D-2. d(P,Q) = Oif and only if P = Q. 
D-3. d(P,Q) = d(Q, P) for every P and Q in S. 
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Here we have numbered our first postulate D-0 because it is never going to be 
cited in proofs; it merely explains what sort of object d is. Of course d(P, Q) will 
be called the distance between P and Q, and, for the sake of brevity, we shall 
write d(P, Q) simply as PQ. (We shall be using distances so often that we ought 
to reserve for them the simplest notation available, and “PQ” is absolute rock- 
bottom.) 


Surely any reasonable notion of distance ought to satisfy D-1 through D-3. 
We might have required also that í 


PQ+QR 2 PR, 


which would say, approximately, that “a straight line is the shortest distance - 
between two points.” But as it happens, we don’t need to make this statement 
a postulate, because it can be proved on the basis of other geometric postulates, 
to be stated later. 

Henceforth, until further notice, the distance function d is going to be part of 
our structure. Thus the structure, at the present stage, is 


IS, £, ®, d]. 


The distance function is connected up with the rest of the geometry by the ruler 
postulate D-4, which we shall state presently. 
We ordinarily think of the real numbers as being arranged on a line, like this: 


Figure 3.1 


If the “lines” in our geometry, that is, the elements of £, really “behave like lines,” 
then we ought to be able to apply the same process in reverse and label the points 
of any line L with numbers in the way that we label the points of the x-axis in | 
analytic geometry : 


T P R Q 
Xj -1 0 1 xy 
Figure 3.2 


If this is done in the usual way, then we have a one-to-one correspondence, 
f:LeR, 


between the points of L and the real numbers. This correspondence will turn 
out to be a coordinate system, in a sense which we shall soon define. Meanwhile, 
therefore, if x = f(P), we shall refer to x as the coordinate of P. Im the figure, - 
the coordinates of P, Q, R, and T are 0, xı, 1, and xz. If the coordinates are related 
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to distance in the usual way, then 
PO = jz] and PT = |r3l. 
In fact, no matter where Q and T may lie on the line, we will always have 
QT = |z2 — zıl. 


(You ean check this for the cases z < xı < 0,42 < 0 < tı, and 0 < T2 < Tı. 
There is no harm in assuming that z9 < 2x1, because when x, and vz are inter- 
changed, both sides of our equation are unchanged.) 

Obviously nothing can be proved by this discussion, because the postulates that 
we have stated so far do not describe any connection at all between the distance 
function and lines. All that we have been trying to do is to indicate why the 
following definition, and the following postulate, are reasonable. 


DEFINITION. Let 
fF:LeR 


be a one-to-one correspondence between a line L and the real numbers. If for 
all points P, Q of L, we have 


PQ = |f(P) — f], 


then f is a coordinate system for L. For each point P of L, the number z — f(P) 
is called the coordinate of P. 


D-4. The Ruler Postulate. Every line has a coordinate system. 


The postulate D-4 is called the ruler postulate because, in effect, it furnishes 
us with an infinite ruler which can be laid down on any line and used to measure 
distances along the line. This kind of ruler is not available in classical Euclidean 
geometry. When we speak of “ruler-and-compass constructions" in classical geom- 
etry, the first of these abstract drawing instruments is not really a ruler, because 
it has no marks on it. It is, properly speaking, merely a straight-edge. You can 
use it to draw the line containing two different points, but you can’t use it to 
measure distances with numbers or even to tell whether two distances PQ, RT 
are the same. 

As it stands, D-4 says merely that every line has at least one coordinate system. 
It is easy to show, however, that there are lots of others. 


Theorem 1. If f is a coordinate system for L, and 
g(P) = —f(P) 
for each point P of L, then g is a coordinate system for L. 


Proof. It is plain that the condition g(P) = — f(P) defines a function L — R. 
And this function is one to one, because if x = g(P), it follows that —z = f(P), 
and P = f-!(—), so that P is uniquely determined by z. 
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It remains to check the distance formula. Given that 
z= g(P), vw-9Q, 
PQ = |z — yl. 
—z —f(P, —y=f(Q). 
Since f is a coordinate system, it follows that 
PQ = |(72) — (-9)l. 


PQ = |y — z| 
= |z — yl. 


we want to prove that 


We know that 


Therefore 


which was to be proved. 

Theorem 1 amounts to a statement that if we reverse the direction of the co- 
ordinate system, then we get another coordinate system. We can also shift the 
coordinates to left or right. 


Theorem 2. Let f be a coordinate system for the line L. Let a be any real number; 
and for each P € L, let 


g(P) = f(P) + a. 
Then g: L — R is a coordinate system for L. 
The proof is very similar to that of the preceding theorem. Combining the 
two, we get the following theorem. 


Theorem 3. The Ruler Placement Theorem. Let L be a line, and let P and Q 
be any two points of L. Then L has a coordinate system in which the coordinate 
of P is 0 and the coordinate of Q is positive. 


Proof. Let f be any coordinate system for L. Let a — f(P); and for each point 
T of L, let g(T) — f(T) — a. 

Then g is a coordinate system for L; and g(P) = 0. If g(Q) > 0, then g is the 
system that we were looking for. If g(Q) < 0, let h(T) = —g(T) for every 
T € L. Then h satisfies the conditions of the theorem. 


PROBLEM SET 3.3 
1. Show that D-1, D-2, and D-3 are consequences of the ruler postulate. 
3.4 BETWEENNESS 
One of the simplest ideas in geometry is that of betweenness for points on a line. 


In fact, Euclid seems to have regarded it as too simple to analyze at all, and he 
uses it, without comment, in proofs, but doesn’t mention it at all in his postulates. 
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Roughly speaking, B is between A and C on the line L if the points are situated 
like this: 


A B c 
FiaunE 3.3 
or like this: 
c B A 
FIGURE 3.4 


(Logically speaking, of course, the second figure is superfluous, because on & line, 
there is no way to tell left from right or up from down.) What we need, to handle 
betweenness mathematically, is an exact definition which conveys our common- 
sense idea of what betweenness ought to mean. One such definition is as follows. 


Derinirion. Let A, B, and C be three collinear points. If 
AB+ BC = AC, 
then B is between A and C. In this case we write A-B-C. 


As we shall see, this definition is workable. It enables us to prove that between- 
ness has the properties that it ought to have. 


Theorem B-1. If A-B-C, then C-B-A. 

This is a triviality. If AB + BC = AC, then CB + BA = CA. 

The rest of the basic theorems on betweenness are going to depend essentially 
on the ruler postulate. 

Betweenness for real numbers is defined in the expected way; y is between z 
and z if either z < y < zorz < y < T. In this case we write z-y-z. (Confusion 
with subtraction is unlikely to occur, because *» minus y minus z" would be 
ambiguous anyway.) 

Lemma 1. Given a line L with a coordinate system f and three points A, B, 

C with coordinates z, y, 2, respectively. If z-y-z, then A-B-C. 


Proof of lemma. (1) Ife < y < 5 then 
4 — AB-ly-—:3—9—9 

because y — z > 0. Fór the same reasons, 
BC = |z — y| =z- y 


and 
AC = |2 — r| = 2 — z. 
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Therefore 
AB + BC = (y — x) + (2 — y) 
=z— t 
= |z — z| 
= AC 
so that A-B-C. 


(2) Ifz < y < 2, it follows by a precisely similar argument that C-B-A, which 
means that A-B-C, as before. 


Theorem B-2. Of any three points on a line exactly one is between the other - 
two. 


Proof. (1) Let f be a coordinate system for the line; and let x, y, z be the co- 
ordinates of the points A, B, C. One of the numbers z, y, z is between the other 
two. By Lemma 1, this means that the corresponding point A, B, or C is between 
the other two points. 

(2) We now need to prove that if A-B-C, then neither of the conditions B-A-C, 
A-C-B holds. If B-A-C, we have 


BA + AC = BC. 
But we have given that 
AB + BC = AC. 
By addition, we get 
BA + AC + AB + BC = BC+ AC, 
or 


2AB = 0. 


Therefore AB = 0. This is impossible, because A # B. 
The proof that both A-B-C and A-C-B cannot hold is precisely analogous. 
Consider now four points A, B, C, D of a line L. In the list below, we indicate 
the four possible triplets that can be formed from these; opposite each triplet we 
have listed the three possible betweenness relations. 


A,B,C: A-B-C, A-C-B, B-A-C, 

A, B,D: A-B-D, A-D-B, B-A-D, 

A C, D: A-C-D, A-D-C, C-A-D, 

BG, D: B-C:D, B-D-C, C-B- D. 

When we write 
A-B-C-D, 

we mean that all the overscored betweenness relations A-B-C, A-B-D, A-C-D and 
B-C-D hold, but none of the other eight relations hold. (Thus A-B-C-D is a rather 
efficient shorthand.) The scheme is, of course, easy to remember; the relations 


that hold are the ones that you can get by leaving out one of the letters in the 
expression A-B-C- D. 
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Theorem B-3. Any four points of a line can be named in an order A, B, C, D, 
in such a way that A-B-C-D. 


Proof. Let f be the line that contains our four points P, Q, R, S. The coordinates 
of our points are four numbers; and these appear in some order 


wz«rczyczz 
Here w, x, y, and z are f(P), f(Q), F(R), f(S), but not necessarily respectively. Let 
A-f'& B=f'@, c=r'w, D-f'Q 


From the double inequalities w <x < y, W <uczuwey<4r<y<% 
we get (by Lemma 1) the betweenness relations A-B-C, A-B-D, A-C-D, B-C-D. 
Thus, for each three of our four points, we have a betweenness relation; and 
Theorem B-2 tells us that every three points stand in only one betweenness relation. 
Therefore our list is complete, and A-B-C-D, which was to be proved. 


Theorem B-4. If A and B are any two points, then (1) there is a point C such 
that A-B-C, and (2) there is a point D such that A-D-B. 


Proof. Take a coordinate system f for the line AB that contains A and B. 


A D B c 
x x+y D ytl 
2 
FIGURE 3.5 


There is no loss of generality in supposing that x < y. (See Theorem 1, Sec- 
tion 3.3.) As in the figure, let 


C — fly. 
Then A-B-C because z < y < y t 


As in the figure again, let 
D= Jd (£22) " 


2r < z 4- y € y. 


Since x < y, we have 


(Why?) Therefore 
que IS « y, 


so that A-D-B. 
In the next few chapters, we shall want to handle betweenness by referring only 


. to the theorems of this section, without going back to the definition. (The reasons 
for this will be explained much later.) It turns out that the theorems above are 
adequate, if we include the following trivial one. 
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Theorem B-5. If A-B-C, then A, B, and C are three different points of the same 
line. 

This held, of course, under our original definition of the relation A-B-C. 

For convenience of reference, we list the basic properties of betweenness. 


B-1. If A-B-C, then C-B-A. 
B-2. Of any three points on a line, exactly one is between the other two. 


B-3. Any four points of a line can be named in an order A, B, C, D, in such a 
way that A-B-C-D. 


B-4. If A and B are any two points, then (1) there is a point C such that A-C-B, 
and (2) there is a point D such that A-B-D. 


B-5. If A-B-C, the A, B, and C are three different points of the same line. 


PROBLEM Ser 3.4 


1. Show that if A-B-C and B-C-D, then A-B-D and A-C-D. 

2. Show that if A-B-C and A-D-C, then A-B-D-C, A-D-B-C, or B — D. 

3. Given four spherical beads of different colors. In how many different ways is it 
possible to arrange them in a trough, in order from left to right. (This is & problem in 
order.) 

4. Given four beads as in Problem 3. In how many essentially different ways is it 
possible to arrange them on a rigid symmetrical rod. (This is a problem in betweenness.) 

5. Given four beads as in the preceding problems. In how many essentially different 
ways is it possible to arrange them on a string so as to make a four-bead necklace? (The 
String is so thin that the knot can slip through the holes in the beads. This is & problem 
in "betweenness on a circle," and the answer indicates that the idea of "betweenness on 
& circle" is much more peculiar than one might have supposed.) 


6. Prove the following converse of Lemma 1. 
lemma 2. Given a line L with a coordinate System f, and three points A, B, and C 
with coordinates z, y, and z, respectively. If A-B-C, then z-y-2. 


7. In this section, we defined a betweenness relation for real numbers, by saying that 


z-y- if either z < y < z or z < y € z. Show that, for this betweenness relation, 
Conditions B-1 through B-4 hold true. 


3.5 SEGMENTS, RAYS, ANGLES, AND TRIANGLES 


If A and B are two points, then the segment between A and B is the set whose 
points are A and B, together with all points between A and B. By B-5, the seg- 


ment lies on the line AB, and we have the figure below. As indicated, the segment 
is denoted by AB. 


Ficure 3.6 
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If A and B are two points, then the ray from A through B is the figure that 


looks like this: aa 
AB 


Figure 3.7 


As indicated, the ray is denoted by AB. 

More precisely, the ray AB is the set of all points C of the line AB such that 
A is not between C and B. The point A is called the end point of the ray AB. 

If this definition looks peculiar, you should check it against the figure to make 
sura that it agrees with our rough notion of what points of the line ought to be 
on the ray. It is fairly easy to see that AB is the union of (1) the segment AB, 
and (2) the set of all points C such that A-B-C. If this latter description seems 
more natural to you, you are welcome to regard it as the definition of a ray. 

Roughly speaking, an angle is a figure that looks like this: 


B 


c 


Figure 3.8 


More precisely, an angle is a figure which is the union of two rays which have the 
same end point, but do not lie on the same line. If the angle is the union of AB 
and AC, then these rays are called the sides of the angle; the point A is called the 
vertex; and the angle itself is denoted by the symbol : 


ZBAC. 


Notice that we always have ZBAC — ZCAB. 
Finally, if 4, B, and C are three noncollinear points, then the set 


AB u BC u AC 


is called a triangle. B 


AC 
Figure 3.9 


The three segments AB, BC, and AC are called its sides; and the points A, B, 
and C are called its vertices. (The English plural vertexes is used by some authors 
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but not others.) The triangle itself is denoted by the symbol 
AABC. 
The angles of A ABC are ZBAC, Z ACB, and Z ABC. Note that A ABC con- l 
tains none of these three angles, because the sides of an angle are rays and the sides 


of a triangle are segments (which are, to say the least, considerably shorter). If 
we drew in all the angles, the figure would look like this; 


B 


Figure 3.10 
The following theorems look quite easy, but some of them aren’t. 
Theorem 1. If A and B are any two points, then AB — BA. 
Theorem 2. If C is a point of AB, other than A, then AB = AC. 


Theorem 3. If B, and C 1 are points of AB and AC, other than A, then Z BAC = 
Z B,AC;. 


Figure 3.11 


Theorem 4. If AB = CD, then the 
D, in some order, (That is, the 
by the segment.) 


points A, B are the same as the points C, 
end points of a segment are uniquely determined 


Theorem 5. If AABC = ADEF, then the points A, B, and C are the same as 
the points D, E, and F, in some order. (That is, the vertices of a triangle are 
uniquely determined by the triangle.) 


If you review the definitions of AB, AB, ZBAC, and AABC, you will see 
that all of these definitions are based on the idea of betweenness. The proofs of 
Theorems 1 through 5 must, therefore, be based on Theorems B-1 through B-5. 
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A word of caution: Here and hereafter, the symbol = is going to be used in one 
and only one sense; it means “is exactly the same as.” Thus, when we write 
AB = BA, we mean that the sets AB and BA have exactly the same elements. 

Finally, a few remarks may be in order about the way in which we have defined 
the idea of an angle. Under our definition, an angle is simply a set which is the 
union of two noncollinear rays. Angles, in this simple sense, are quite adequate 
for the purposes of Euclidean geometry. 

Much later, in analytic geometry and in trigonometry, we shall need to talk 
about directed angles in which the initial side can be distinguished from the terminal 
side, like this: 

C C 


A A 
B 


(a) (b) 
Figure 3.12 


An angle, in this sense, is not a set of points, but rather an ordered pair (AB, AC) 
of rays; thus (AB, AC) is different from (AG, AB). For directed angles, we allow 
the possibility that the sides are collinear, and we also allow the possibility that 
the sides are the same. We have not used this more complicated idea of an angle, 
because at the present stage we have no use for it. For example, the angles of 
a triangle never consist of two collinear rays, and there is no natural way to 
assign directions to them. 


PROBLEM Ser 3.5 
1. Prove Theorem 1. 
2. Show that, given a ray AB, there is a coordinate system f on the line AB such that 


AB = (P|f(P) = 9). 
3. Prove Theorem 2. 
4. Prove Theorem 3. 

*5. Prove the following. Let A and B be two points, and let D, E, and F be three 
noncollinear points. If AB contains only one of the points D, E, or F, then each of the lines 
DE, DF, EF intersects AB in at most one point. 

*6. Prove the following. If AABC = ADEF, then each of the lines AB. BC, AC con- 
tains two of the points D, E, and F. 

*7. Show that for any A ABC, we have 

ABA AABC = AB. 
That is, the only points of AB that lie on the triangle are the points of the side AB. 

*8. Prove the following. If AABC = ADEF, then each side of A ABC contains two 
of the points D, E, and F. 

*9. Prove Theorem 5. 
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3.6 CONGRUENCE OF SEGMENTS 


The intuitive idea of congruence, for any two figures at all, is always the same. 
Two figures F and G are congruent if one can be moved so as to coincide with the 
other. Thus two equilateral triangles of the same size are always congruent; 
two circles of the same radius are always congruent; two squares of the same size 
are always congruent, and so on. 


EMO 


Figure 3.13 


In the same way, two segments of the same length are always congruent. 


oe 
A B e D 
FiaunE 3.14 


Here, by the length of a segment, we mean the distance between its end points. 

Our problem, in our mathematical study of congruence, is to formulate the idea 
in sufficiently exact form to be able to prove things about it. In the present sec- 
tion, we shall do this for the case in which the figures are segments. (This is the 
easiest case of all.) Later we shall do the same for the case in which the figures are 
angles; and still later, we shall discuss triangles. Finally, in the chapter on rigid 
motion, we shall discuss congruence in a form sufficiently general to apply to any 
two sets of points. 

We start with our official definition. 


DzrixirioN. Let AB and CD be segments. If AB = CD, then the segments are 
called congruent, and we write AB = CD. 


On the basis of this definition, it is easy to prove the familiar and fairly trivial 
facts about congruence of segments. 


A relation ~, defined on a set A, is called an equivalence relation if the following 
conditions hold. 


(1) Reflexity. a — a, for every a. 
(2) Symmetry. If a — b, then b — a. 
(3) Transitivity. If a ~ b and b ~ c, then a ~ c. 


Theorem C-1. l'or segments, congruence is an equivalence relation. 


_ That is, every segment is congruent to itself; if AB & CD, then CD = AB; if 
AB = CD and CD z EF, then AB & EF. 


Proof? There isn't much to it. 


CONGRUENCE OF SEGMENTS 59 


Theorem C-2. The Segment-Construction Theorem. Given a segment AB and 
a ray CD. There is exactly one point E of CD such that AB = CE. 


E 
oo 2 
A B C 
AB=CE 
Figure 3.15 


That is, starting at the end point of a ray, you can measure off a segment of 
any desired length, and the resulting segment is unique. 
_Proof. By the ruler placement theorem, set up a coordinate system f for the line 
CD, in such a way that f(C) = 0 and f(D) > 0. 


Cc D E 
o CD x=AB 
Fieure 3.16 


In the figure, we have indicated that the number CD is the coordinate of the point 
D, and this is correct, because f(D) > 0. If E is a point of CD, then CE = AB 
if and only if f(E) = AB as in the figure. Thus CE œ AB if and only if E = 
f^ (AB). There is exactly one such point f~ 1( 4B), and therefore there is exactly 


one such point E. 
The following theorem says, in effect, that if congruent segments are laid end 


to end, the resulting segments are congruent. 


L . 
| Mer card es co UTR 
A' B Cc L 
B 
A 


Figure 3.17 


Theorem C-3. The Segment-Addition Theorem. If 


(1) A-B-C, 

(2) A'-B'-C', 
(3) AB = AP, 
and : 
(4) BC = BC’, 
then 


(5) ACS AT’. 


We also have a converse. 
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Theorem C-4. The Segment-Subtraction Theorem. If (1) A-B-C, (2) -A'-B'-C'; 
(3) AB = A'B', and (4) AC = AC’, then (5) BC & BU. 


These theorems can most conveniently be proved by means of the definition of 
betweenness. You should work out the proofs in full. 

Note that we have called Theorem C-4 a converse of Theorem C-3, rather than 
the converse of Theorem C-3. The reason is that most theorems have more than one 
converse (each of which, of course, may or may not be true). For statements of 
the form P >Q, where P and Q are propositions, the situation is simple. The 
converse of the implication P > Q is the implication Q> P. A theorem may, 
however, be stated in the following form: “If (a), (b), and (c), then (d), (e), and 
(f).” This says that 


[(a) and (b) and (c)] 5 [(d) and (e) and (f)]. 


In this case, any statement that you get by interchanging part of the hypothesis 
and part of the conclusion is called a converse. Thus, to get Theorem C-4 from 
Theorem C-3, we moved (5) into the hypothesis, and moved (4) into the conclu- 
sion. Theorem C-3 has three more converses, You should state them and find 
out which of them are true. 

If A-B-C, and AB = BC, then B is a midpoint of AC. The following theorem 
justifies us in referring to B as the midpoint. 


Theorem C-5. Every segment has exactly one midpoint. 


Proof. Given ; AC. By the ruler placement theorem, take a coordinate system 
f, for the line AC, such that f(A) = 0 and f(C) > 0. 


-4————————— ————————M 
oO x AC 
Figure 3.18 


If B is between A and C, then 


AB —|r—0|—z 
and 
BC = |AC — z| = AC — x. 


Thus, for the case where A-B-C, the condition AB = BC is equivalent to the 
condition 


t= AC —z, 
or 
2x = AC, 
or 
AC 
g = —. 
2 


There is exactly one such number z, and therefore there is exactly one such point B. 


CHAPTER A 


SEPARATION IN PLANES AND SPACE 


4.1 CONVEXITY AND SEPARATION 


_A set A is called convex if for every two points P, Q of A, the entire segment 
PQ lies in A. For example, the three figures below are convex: 


ANC» @ 


Figure 4.1 


Here each of the sets A, B, and C is a region in the plane; for example, A is the 
union of a triangle and the set of all points that lie inside the triangle. We have 
illustrated the convexity of the sets A, B, and C by drawing in some of the seg- 
ments PQ. 

On the other hand, none of the sets D, E, and F below are convex: 


FIGURE 4.2 


To show that a set, say D, is not convex, you have to show that there are two 
points P and Q, both belonging to D, such that PQ does not lie in D. This is what 
we have indicated, for each of our last three figures. 
A convex set may be very "thin and small." For example, every segment PQ 
is a convex set. In fact, a set with only one point is convex. (Since such a set does 
61 
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not contain any two points, it follows that every two points of it have any propert: 
we feel like mentioning.) | 

A convex set may also be very large. For example, the whole space S is a convex 
set; and all lines and planes are convex. (Proof?) Given a line L ina plane Æ, the 
parts of E that lie on the two sides of L are both convex. 


& -----\.-----+4 


FIGURE 4.3 


In Fig. 4.3, H, is the part of the plane lying above and to the left of the line L, 
and Ho is the part of the plane that lies below and to the right of L. The sets 


PS-1. The Plane-Separation Postulate. Given a line and a plane containing it, 
the set of all points of the plane that do not lie on the line is the union of two 
sets such that (1) each of the Sets is convex, and (2) if P belongs to one of the sets 
and Q belongs to the other, then the segment PQ intersects the line. 


We can now begin to state official definitions based on our postulates. If Æ and 
L are a plane and a line, as in the postulates, and 77 1 and H3 are the two sets given 
by the postulate, then each of the sets H, and H 2 is called a half plane, and L is 
called the edge of each of them. 

Obviously there is no natural way to decide which of the half planes should be 
mentioned first, but except for this question of order, the two half planes are 
uniquely determined by E and L. To see this, we observe that if P € H 1, then 
the points of H, are the point P and the points Q such that 


PQnL- g. 


Similarly, 
Hi: = (QQ€ E — Land POOL # p. 
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PROBLEM SET 4.1 


Prove the following theorems. In all these theorems, it should be understood that 
E, L, Hi, and H2 are a plane, a line, and the two half spaces given by PS-1. The proofs 
are of a sort that may not be at all familiar to you. Obviously, PS-1 uses the idea of a 
segment, and segments were defined in terms of betweenness. Therefore the chances are 
that you will have to appeal to a fair number of postulates (and theorems) other than 
PS-1. Use of the ruler postulate or the ruler placement theorem is not allowed; the proofs 
should use, instead, the theorems B-1 through B-5 which were based on them.* 

1. Theorem A. The sets Hı and H» are not both empty. 

2. Theorem B. Neither of the sets Hı and H2 is empty. 

3. Theorem C. Hi; contains at least two points. 

4. Theorem D. Hı contains at least three noncollinear points. 

5. Theorem E. E is uniquely determined by H;. That is, every half plane lies in. 
only one plane. 

6. Theorem F. Lis uniquely determined by Hı. That is, every half plane has only 
one edge. 

7. Theorem G. Every ray is convex. 

8. Theorem H. Hı U L is convex. 

9. Theorem |. If A and B are convex, then so also is A N B. 

10. Theorem J. If G is any collection of convex sets g;, then the intersection of all of 

the sets g; in the collection is convex. 

The convex hull of a set A is the intersection of all convex sets that contain A. 

1l. Theorem K. If A is any set of points, then the convex hull of A is convex. 

12. Let A be a set of points. Let B be the union of all segments of the form PQ, 
where P and Q belong to A. Does it follow that B is convex? Why or why not? 

13. Theorem L. Given a triangle ABC, and a line L in the same plane. If L contains 

no vertex of the triangle, then L cannot intersect all of the three sides. 


A 


FIGURE 4.4 


14. Theorem M. The Postulate of Pasch. Given a triangle ABC, and a line L in the 

same plane. If L contains a point between A and B, then L also intersects one of the 

other two sides of the triangle. 

(We call this a postulate because in the work of Pasch it was used as such, in place of 
PS-1 above.) 

* There is one exception. In proving Theorem G, you may use the result of Problem 2 
of Problem Set 3.5. 
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4.2 INCIDENCE THEOREMS BASED ON THE PLANE-SEPARATION POSTULATE 
If E — L= Hı U Hs, 


as in the plane-separation postulate, then we say that the sets Hı and Ho are 
half planes of L or sides of L. Notice that every line has two sides in every plane 
that contains it, but if P and Q are on the same side of a line L, this automatically 
means that L, P, and Q are coplanar. On the other hand, to say that P and Q are 
on different sides of L, in space, may mean merely that no one plane contains L, 
P, and Q. If E — L = H, U Hs», as in the plane-separation postulate, then H, 
and Ho are called opposite sides of L; and if P belongs to H, and Q belongs to 
Hp», we say that P and Q are on opposite sides of L. 
The following two theorems are easy. 


Theorem 1. If P and Q are on opposite sides of the line L, and Q and T' are 
on opposite sides of L, then P and T are on the same side of L. 


Theorem 2. If P and Q are on opposite sides of the line L, and Q and T are on 
the same side of L, then P and T are on opposite sides of L. 


We use a similar terminology for the “sides of a point" on a line. That is, if 
A-B-C, then the rays BÁ and BÓ are called opposite rays. 


BA BC 
r — tes ret eee 
A B C 


Figure 4.5 


Theorem 3. Given a line, and a ray which has its end point on the line but does 
not lie on the line. Then all points of the ray, except for the end point, are on 
the same side of the line. 


Proof. Let L be the line, and let AB be the ray, with A € L. 


P Lg 
uu DP 
Figure 4.6 


Suppose that AB contains a point C such that B and C are on opposite sides of L 
(in the plane that contains Land AB). Then BC intersects L in some point, and this 
point must be A, because BC lies in AB, and AB intersects L only in A. Therefore 
C-A-B. But this is impossible. By definition, the ray AB is the set of all points 
C of the line AB for which it is not true that C-A-B, Therefore all points of the ray, 
other than A, are on the same side of L, namely, the side that contains B. 
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Given ZBAC. 


BY Interior 


D 


Exterior 


^l 
Exterior 


Figure 4.7 


Roughly speaking, the interior of the angle is the set of all points that lie inside it, 
and the exterior is the set of all points that lie outside it. We can make this idea 


precise in the following way. 

The interior of Z.BAC is the intersection of the side of AC that contains B, 
and the side of AB that contains C. Thus a point D lies in the interior (1) if D 
and B are on the same side of AC, and (2) if D and C are on the same side of AB 

For this definition to be valid, it has to depend only on the angle that we started 
with, and not on the points B and C that we happened to choose to describe the 
angle. Thus, in the figure below, it would be sad indeed if our definition gave us 
two different interiors for 2B’ AC’ and ZBAC: 


B 
B 
A 
LEA, ad 


Ficure 4.8 


The preceding theorem shows, however, that our definition really does depend 
only on the angle, because B and B' are always on the same side of AC, and C 


and C’ are always on the same side of AB. 
Given an angle Z ABC, there is exactly one plane E that contains it. The 
exterior of the angle is the set of all points of E that lie neither on the angle nor in 


its interior. 


Theorem 4. Every side of a triangle lies, except for its end points, in the interior 


of the opposite angle. 
Here we are using the ordinary terminology; that is, in A ABC, the angle ZA = 


Z.BAC is opposite the side BC. 


B 
D 
A 
[^j 


Figure 4.9 
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Proof. Let D be a point of BC, between B and C. Then C is not between B and D. 
(Why?) Therefore BD does not contain C. "Therefore BD does ; not contain any” 
point at all of the line AC, because BD lies in BC, and BC and AC intersect only 
at C. Therefore, (1) D and B are on the same side of AC. In a precisely similar 
way, it follows that (2) D and C are on the same side of AB. These two conditions 
tell us that D is in the interior of Z BAC, which was to be proved. 


Theorem 5. If A-C-D, B-F-C, and A-F-G (all in the same plane), and A, B, and 


C are not collinear, then G is in the interior of ZBCD. 
B 


Gy) B 
Ficure 4.10 


Proof. (1) Since B-F-C, it follows that BF does not intersect AC. Therefore 
B and F are on the same side of AC. zs 

(2) Since A-F-G, it follows that FG does not intersect AC. Therefore F and G 
are on the same side of AC. on 

(3) By (1) and (2), G is on the side of AC that contains B. 

(4) Since AC = CD, it follows that G is on the side of CD that contains B. 

Thus we have proved half of our theorem. re 

(5) Since A-C-D, A and D are on opposite sides of BC. 

(6) Since A-F-G, A and G are on opposite sides of BC. 

(7) Therefore, by Theorem 1, G is on the side of BC that contains D. 

Steps (4) and (7) tell us that G is in the interior of 2 BCD, which was to be proved. 

Throughout this section, we have been using figures to help us keep track of 
what is going on. (Here us, of course, includes the author. All authors use figures, 
whether or not they decide to show these to the reader.) You should watch very 
carefully, however, to be sure that the figures are playing merely their legitimate 
part as memoranda. It is customary, in elementary texts, for the reader to be 
assured that “the proofs do not depend on the figure,” but these promises are 
almost never kept. (Whether such promises ought to be kept, in an elementary 
course, is another question, and the answer should probably be “No.”) In a 
mathematically thorough treatment, however, the hypothesis and conclusion ought 
to be stated so fully and explicitly that no figure is actually necessary to make 
them plain; and in the same way, the proofs ought to rest on the postulates and the 
previous theorems. This point is especially relevant in the present context, because 
in most informal treatments of geometry it is customary to convey betweenness 
relations and separation properties only by figures, without ever mentioning them 
in words at all. 


You may be able to remember a situation, in elementary geometry, where 
Theorem 5 is needed. 
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The interior of A ABC is defined as the A 
intersection of the following three sets: Z 
: T5 : LEO 
(1) The side of AB that contains C. CERIO, 
(2) The side of AC that contains B. BREEZE Is c 
(3) The side of BÓ that contains A. Figure 4.11 


Theorem 6. The interior of a triangle is always a convex set. 
Proof? 


Theorem 7. The interior of a triangle is the intersection of the interiors of its 
angles. 
Proof? 


4.3 INCIDENCE THEOREMS CONTINUED 
In the figure below, D is supposed to be in the interior of ZBAC. 


FIGURE 4.12 


It surely looks as if AD ought to intersect BC. We ought to be able to prove this, on 
the basis of our postulates; and in fact, if we were unable to do this, after suitably 
vigorous effects, this would give us reason to suspect that our postulates were not 
a really adequate description of the aspects of geometry that they purport to cover. 
In this section, we give a proof. But it is rather surprisingly long; and a number 
of preliminary results had better come first. 

In the figure below, it looks as if all of the ray AD, except for A, lies in the interior 
of ZBAC. And this is true: 


B 
D 
A 
C. 
FIGURE 4.13 


Theorem 1. If D lies in the interior of ZBAC, then AD — A lies in the interior 
of ZBAC. 
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Proof. (1) By hypothesis, D | D lies on the side-of AC that contains B. By Theorem 
3, Section 4.2, all points of AD — A lie on the side of AG that contains B. 

(2) For the same reasons, all points of AD — A lie on the side of AB that contains 
C. 

(1) and (2) tell us that AD — A lies in the interior of Z BAC, which was to be - 
proved. 


Theorem 2. If D lies in the interior of ZBAC, and G-A-D, then AG — A lies + 
in the side of AC that does not contain B. 


Figure 4.14 


Proof. Since G-A-D, G and D a are on opposite sides of AC. By hypothesis, 
B and D are on the same side of AC. By Theorem 2, Section 4.2, it follows that 
G and B are on opposite sides of AC. But, by Theorem 3, Section 4.2, all points 
of AG — z4 are on the same side of AC. Therefore all points of AG — Aare on the 
side of AC that does not contain B. 


In the figure below, it looks as if F and B are on the same side of AD. 


Figure 4.15 


Theorem 3. If D is in the interior of Z BAC, and F-A-C, then F and B are on 
the same side of AD. 


Our conclusion means that FB does not intersect the line AD. This is rather 
surprisingly hard to prove. In fact, we have to break our problem up into two parts, — 
in the following way. Let G be a point such that G-A-D, as indicated in the figure. 
Then AD and AG are opposite rays, so that 


AD = AD u AG. 
We shall prove that 
(I) FB does not intersect AD, and 
(II) FB does not intersect AG. 
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It will follow, of course, that FB does not intersect AD. In detail, the proof 
goes like this: 

(1) F and C are on opposite sides of AB (because F-A-C). 

(2) All points of BF — B are on the side of AB that does not contain C. (by 
Theorem 3 of Section 4.2). 

(3) FB contains no point of the interior of ZBAC. (Any such point would lie 
in BF, and BF contains no point of the interior. It lies on the “wrong side" of AB.) 

(4) FB contains no point of AD. (FB does not contain A; and all the rest of 
AD lies in the interior of ZBAC.) Thus, (I) is true. 

(5) FB — F lies on the side of AC that contains B (by Theorem 3, Section 4.2.) 

(6) FB — F lies on the side of AC that contains B (because FB lies in FB). 

(7) AG — A lies on the side of AC that does not contain B (by Theorem 2). 

(8) FB contains no point of AG. (FB does not contain A, and AG does not con- 
tain F, and by steps (6) and (8), FB — B and AG — A lie on opposite side of AC.) 
Thus (II) is also true; and the theorem follows. 

After all these preparations, we can prove the trivial-looking theorem that we 
were working toward. k 


Theorem 4. The Crossbar Theorem. If D is in the interior of Z BAC, then AD 
intersects BC. 


Figure 4.16 


Proof. Suppose that AD does not intersect BC. Then (1) B and C are on the 
same side of AD. Let F be a point such that F-A-C. By Theorem 3, we have 
(2) F and B on the same side of AD. (3) Therefore, F and C are on the same side 
of AD. But (3) is impossible, because F-A-C. 

You may remember that one of the first theorems in plane geometry that most 
people learn is the one dealing with the base angles of an isosceles triangle. These 
are always “equal,” that is, congruent in the sense in which we shall define the 
latter term later in this book. That is, if AB œ AC, then ZB = ZC. 


A 


D 
Figure 4.17 
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Although a good proof of the theorem was known in antiquity, it has become cus- 
tomary in later centuries to prove it in needlessly complicated ways; and probably: 
the worst of these rambling detours is the proof that starts by telling you (1) to 
bisect ZBAC, (2) to "let" D be the point where the bisecting ray AF intersects 
the base, and (3) to show that AADB and AADC are congruent. 

Of course, a light-hearted use of the word let is no substitute for a proof that 
AF intersects BC. This method of proof thus depends essentially on the crossbar 
theorem. We shall see, however, that for the simple theorem that we have been 
discussing, the whole issue of the crossbar theorem is easy to bypass; we don’t 
need a tractor to pull a kiddie car. 


PROBLEM Ser 4.3 


1. Theorem 5. Given a triangle, and a line lying in the same plane. If the line inter- 
sects the interior of the triangle, then it intersects at least one of the sides. 


4.4 CONVEX QUADRILATERALS 


Given four points A, B, C, and D such that they all lie in the same plane, but 
no three are collinear. If the segments AB, BC, CD, and DA intersect only at 
their end points, then their union is called a quadrilateral, and is denoted by 
[]ABCD. This notation is not meant to suggest that every quadrilateral is a 
square, any more than the analogous notation AA BC ismeant to suggest that every 
triangle is equilateral. 

The angles A, B, C, D of [.]ABCD are ZDAB, ZABC, and so on. 


4 A 


c 


Figure 4.18 


The sides of [ABCD are AB, BC, and so on. Two sides which have a common 
end point are called adjacent; two sides which are not adjacent are opposite. Two 
angles of a quadrilateral are adjacent if their intersection contains a side; and two 
angles which are not adjacent are opposite. 

The diagonals of [JABC D are the segments AC and BD. 

A quadrilateral is called convez if each of its sides lies in one of the half planes 
determined by the opposite side. Note that if A and B are on the same side of 
CD, then all points of AB are on the same side of CD. (The converse is trivial.) 
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Thus (JABCD is a convex quadrilateral if and only if all four of the following 
conditions hold. 

(1) A and B are on the same side of CD. 

(2) B and C are on the same side of DA. 

(3) C and D are on the same side of AB. 

(4) D and A are on the same side of BC. 

Note that the use of the word convex in geometry is inconsistent; no quadrilateral 
can possibly form a convex set in the sense in which the latter term was defined 
in Section 4.1. This usage is universal, however. 

The following theorem is very widely known, but does not usually get proved. 


Theorem 1. The diagonals of a convex quadrilateral always intersect each other. 


Proof. Let ABCD be a convex quadrilateral. We need to show that AC 
intersects BD. 


Ficure 4.19 


By conditions (1) and (2) above, we conclude that B is in the interior of ZA DC. 
Therefore, by the crossbar theorem, DB intersects AC at a point P. 

Similarly, by Conditions (3) and (4) we conclude that A is in the interior of 
ZBCD. Therefore, by the crossbar theorem, CA intersects BD at a point Q. 

Since each of these rays and segments lies in the corresponding line, it follows that 
DB intersects AC at P and also at Q. Therefore P = Q. Since P lies on AC and 
Q lies on BD, it follows that AC and BD have a point in common, which was to 
be proved. 

If you review this chapter and observe how much of it is needed for the proof 
of the above theorem, it will be obvious to you why the proof is commonly omitted 
in elementary treatments. 


4.5 SEPARATION OF SPACE BY PLANES 


The behavior of planes in space, with regard to separation properties, is very 
elosely analogous to the behavior of lines in a plane. We therefore merely state 
the basic postulate, definitions, and theorems, and leave the verifications to the 
reader. 
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SS-1. The Space-Separation Postulate. Given a plane in space. The set of all 
points that do not lie in the plane is the union of two sets Hı, Hə such that 
(1) each of the sets is convex, and (2) if P belongs to one of the sets and Q be- 
longs to the other, then the segment PQ intersects the plane. 


The two sets Hı, Hz described in SS-1 are called half spaces, or sides of the plane 
E, and E is called the face of each of them. As in the case of half planes, there is 
no natural way to decide which of them should be mentioned first; but except for 
order, the sets H; and H» are uniquely determined by E. The reason is that if 
P € H;, then 

H,—Pu(QQeS and PQNE = 0j, 
and 


H: = QIQES—E and PQNE s 0. 


'The following theorems are merely the appropriate revisions of some of the 
theorems in Section 4.1. 


Theorem 1. The sets H; and H are not both empty. 
Theorem 2. Neither of the sets Hı, H2 is empty. 
Theorem 3. Each of the sets Hı and H contains four noncoplanar points. 


Theorem 4. E is uniquely determined by H,. That is, every half space has only 
one face. 


A dihedral angle is a figure that looks like this: 


Figure 4.20 


More precisely, if two half planes H, and H3 have the same edge L, but do not 
lie in the same plane, then the set H, U H5 U L is called a dihedral angle. The 
line L is called the edge of the dihedral angle, and the sets H, U L and H3 U L are 
called its sides. (Note that just as the sides of an angle contain their common 
end point, so the sides of a dihedral angle contain their common edge.) 

The following theorem is analogous to Theorem 3, Section 4.2. 


Theorem 5. Let H be a half plane with edge L, and let E be a plane which 
contains L but not H. Then all points of H are on the same side of E. 
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Given a dihedral angle 
D = H, U H: UL. 


Let E, and E; be the planes that contain Hı and Ho, respectively. Then the 
interior of D is the intersection of (1) the side of E, that contains H2 and (2) the 
side of E3 that contains H1. 


Theorem 6. The interior of a dihedral angle is always a convex set. 


Theorem 7. If P and Q are in different sides of a dihedral angle, then every point 
between P and Q is in the interior of the dihedral angle. 


This section, of course, is hardly more than an introduction to the following 
problems. 


PROBLEM SET 4.5 


1. Prove Theorems 1 through 7. 


CHAPTER 5 


ANGULAR MEASURE 


5.1 DEGREE MEASURE FOR ANGLES 


You will recall that when we started doing geometry, we began with the 


structure 
[S, £, €]. 


Later we included in the structure the distance function 


d:SxS—R. 
This gave us the structure 
[S, £, @, d]. 


In terms of distances between points, we defined betweenness, and also congruence 
for segments. 

We now complete the structure by introducing measure for angles. This will 
turn out to be the familiar degree measure. The situation here is closely analogous 
to that for distance. We could equally well use radians, or any constant multiple 
of degrees or of radians; but since all of these measures for angles behave in es- 
sentially the same way, we may as well simplify the discussion by choosing one 
of them, once for all, and using it consistently thereafter. (Not until we get to 
calculus is there a good reason for preferring radian measure; and the transition to 
it is trivial.) 

Angular measure is going to be a function m, defined for angles, with real num- 
bers as values of the function. For convenience let @ be the set of all angles. 
Then we shall have the structure 


[S, £, 9, d, m], 
where 
m:G —R 


is a function of the angles into the real numbxrs. In the usual functional notation, 
we would write 
m(Z ABC) 


to denote the measure of Z ABC, but since no confusion with multiplication could 
possibly occur, we omit the parentheses and write merely 


mZABC. 
74 
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Since we shall be talking about only one measure function for angles, we can 
write merely 
mZABC = 90, mZDBC = 45. 


Cc 


Figure 5.1 


We do not write mZ ABC = 90°, because the values of the function m are simply 
real numbers; they stand alone and don’t need to carry little flags to indicate 
where they came from. On the other hand, in labeling figures, it is convenient to 
use the degree sign merely to indicate that certain letters or numbers are meant 
to be the degree measures of angles. 


R 
Figure 5.2 
The figures above tell us that 
mZABC = 30, 
and 
mZPQR = r. 


The postulates governing the function m are merely abstract descriptions of the. 
familiar behavior of protractors. With a protractor placed with its edge on the edge 
of the half plane H, as in the figure below, we can read off the measures of a large 
number of angles. 


Ficure 5.3 
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For example, 


mZPAB = 10, 
mZQAB = 40, 
mZRAB = 75. 


By subtraction, we also get 


mZQAP = 40 — 10 = 30, 

mZSAR = 90 — 75 = 15, 

mZCAU = 180 — 130 = 50, 
and so on. These and other uses of a protractor are reflected in the following 
postulates. 


M-1. m ıs à function @ — R, where G is the set of all angles, and R is the set 
of all real numbers. 
M-2. For every angle A, mZA is between 0 and 180. 


M-3. The Angle-Construction Postulate. Let AB bea ray on the edge of the half 
plane H. For every number r between 0 and 180, there is exactly one ray AP, 
with P in H, such that mZ PAB = r. 


FIGURE 5.4 


M-4. The Angle-Addition Postulate. If D is in the interior of ZBAC, then 
mZBAC = mZBAD + mZDAC. 


c 
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Figure 5.5 


(This, of course, is the property of m that we use when we compute the measure 
of an angle by subtraction.) 


DEGREE MEASURE FOR ANGLES 77 


Two angles form a linear pair if they look like this: 


D 


C A B 
Ficure 5.6 


That is, if AB and AG are opposite rays, and AD is any third ray, then ZDAB 
and Z DAC form a linear pair. IfmZABC + mZDEF = 180, then the two angles 
are called supplementary. Notice that this definition says nothing at all about 
where the angles are; it deals only with their measures. 


M-5. The Supplement Postulate. If two angles form a linear pair, then they are 
supplementary. 


r+s=180 
FIGURE 5.7 


Just as we defined congruence for segments in terms of distance, so we define 
congruence for angles in terms of measure. That is, if 


mZABC = mZ DEF, 


then the angles are congruent, and we write 
ZABC = ZDEF. 


If the angles in a linear pair are congruent, then each of them is called a right 
angle, If Z ABC is a right angle, and mZ ABC = r, then, of course, we have 
r = 90; the reason is that since the angles in a linear pair are always supplementary, 
we must have r + r = 180. The converse is also true (and easy). Thus an angle 
is a right angle if and only if its measure is 90. 

The following theorems are closely analogous to our first few theorems on con- 
gruence for segments. To see the analogy, we observe that a sort of “betweenness” 
relation can be defined for rays with the same end point; we might say that AD 
is “between” AB and AC if AD — A lies in the interior of ZBAC. 


B 


A e 
Ficure 5.8 
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The analogy is incomplete, because it is decidedly not true that given any three 
rays with the same end point, one of them is between the other two. (Example?) 
We do have, however, the following theorems. 


Theorem 1. For angles, congruence is an equivalence relation. 


Theorem 2. The Angle-Construction Theorem. Let Z ABC be an angle, let B'C' 
be a ray, and let H be a half plane whose edge contains B'C'. Then there is 
exactly one ray B'A', with A’ in H, such that 


ZABC = Z A'B'C'. 


Theorem 3. The Angle- Addition Theorem. If (1) D is in the interior of ZBAC, 
(2) D’ is in the interior of Z B'A'C", (3) ZBAD Œ Z B'A'D', and (4) ZDAC = 
Z.D' A'C', then (5) ZBAC = Z B'A'C'. 


Figure 5.9 


Theorem 4. If (1) D is in the interior of ZBAC, (2) D' is in the interior of 

ZB'A'C', (3) ZBAD = ZB'A'D', and (4) ZBAC = Z.B'A'C', then (5) 

ZDAC = ZD'A'C'. 

If we translate these theorems into the language of angular measure, using our 
definition of congruence for angles, they are seen to be trivial consequences of the 
postulates for m. 


Two rays are called perpendicular if their union is a right angle. If AB and AC 
are perpendicular, then we write 
AB L AC. 


In this case we also say that the lines AB and AC are perpendicular, and write 
AB L AC. 


Two segments AB, BC are perpendicular if the lines containing them are per- 
pendieular. We use the same term and the same notation for a segment and a 
line, a line and a ray, and so on. Thus AB L PQ means that AB L PQ; this in 
turn means that the union of the two lines contains a right angle. 

An angle with measure less than 90 is called acute, and an angle with measure 


greater than 90 is called obtuse. Two angles are called complementary if the sum of 
their measures is 90 
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If mZBAC < mZ B'A'C', then we say that ZBAC is smaller than ZB’A'C', 
and we write 
ZBAC < ZB'A'C'. 


Note that the relation “is smaller than” is not an order relation; it is quite possible 
for two angles to be different, without either of them being smaller than the 
other. In fact, this happens whenever the angles are congruent. 

Two angles form a vertical pair if their sides form pairs of opposite rays like this: 


E 
Fieure 5.10 


Here ZBAC and 2B’ AC’ form na vertical pair. To be more precise, if B-A-C’, 
C-A-B', and the lines AB and AB’ are different, then Z.BAC and 2B’ AC’ form 
a vertical pair. 


Theorem 5. The Vertical Angle Theorem. If two angles form a vertical pair, 
then they are congruent. 


Restatement. If B-A-C’, C-A-B’, and the lines AB and A’B’ are different, then 
ZBAC = ZB'AC'. 


Fieure 5.11 


Proof. Let 
r= mZ BAB’. 


Since B-A-C’, it follows that AB and AC’ are opposite rays. Therefore Z.BAB' 
and Z.B'AC' form a linear pair. Therefore these angles are supplementary, and 


we have 
mZ B'AC' = 180 — r. 


Similarly, C-A-B', ZBAC, and ZBAB' form a linear pair; these angles are 
therefore supplementary; and we have 


mZBAC = 180 — r. 
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Therefore mZBAC = mZ.B'AC', and ZBAC = ZB'AC', which was to be 
proved. 

If you suspect that the apparatus with which we stated the theorem and worked 
out the proof was superfluous, try “proving” the theorem while gazing at the 
following figure: 


Figure 5.12 


The point is that to prove anything at all about vertical angles, we have to have a 
definition of a vertical pair that is sufficiently exact to give us a hand hold. 


Theorem 6. If two intersecting lines form one right angle, then they form 
four right angles. 


Proof? 


CHAPTER 6 


CONGRUENCES BETWEEN TRIANGLES 


6.1 THE IDEA OF A CONGRUENCE 


As we explained in Chapter 3, the intuitive idea of congruence is the same for 
all types of figures. It means in every case that the first figure can be moved 
without changing its size or shape, so as to coincide with the second figure. There 
are two possible approaches to the problem of treating the idea of congruence 
mathematically. One way is to take it as undefined, state enough postulates to 
describe its essential properties, and then go on to prove whatever theorems turn 
out to be true. In later chapters, we shall show how such a postulational treatment 
works, For the present, however, we shall use a different scheme: we shall define 
congruence, in terms of distance and angular measure, and then proceed to prove 
our theorems on the basis of only one additional postulate. 

The first of these approaches to congruence is called the synthetic approach, 
and the second, which we shall be using for some time to come, is called the metric 
method, We have already applied the metric method to the simplest cases, where 
the figures we are dealing with are segments or angles. Our basic definitions were 
as follows. 


(1) AB = CD means, by definition, that AB = CD. 
(2) By definition, ZBAC = Z PQR means that mZ BAC = mZ PQR. 


We now proceed to the case where the figures involved are triangles. It is obvious, 
in the figures below, that all three of the triangles are congruent. 


F H 


FIGURE 6.1 


'That is, any one of them can be moved onto any other one in such a way that it 

fits exactly. Thus, to move the first onto the second, we should put 4 on D, B 

on F, and C on E. These directions describe the motion by means of a one-to-one 
81 
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correspondence between the vertices of the first triangle and those of the second: 
A € D, 
BeF, 
C o E. 


Similarly, the second triangle can be matched with the third by the correspondence, 
D — I, 
E eG, 
F e H. 


Notice that there is no particular point in giving names to these functions, or in 
using functional notation to describe them; there are only three elements in each 
of the two sets, and we can therefore describe the function quite conveniently, 
simply by writing down all three of the matching pairs. There is a shorthand 
which is even briefer. We can describe our first, correspondence in one line, like 
this: 

ABC e DFE. 


Here it should be understood that the first letter on the left is matched with the 
first on the right, the second with the second, and the third with the third, like this: 


Armee DPF "E 


Fieure 6.2 


Given a correspondence between the vertices of two triangles, there is a naturally 
induced correspondence between the sides and the angles. Thus, given the cor- 
respondence 

: ABC + DFE, 


the induced correspondence between the sides is 
AB o DF, 
BC + FE, 
AC + DE, 
and the induced correspondence between the angles is 


ZA e ZD, 
ZB e ZF, 
ZC o ZE. 


(Here, as usual, we are using the shorthand ZA for ZBAC, ZB for Z ABC, 


— 
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and so on.) When a correspondence between the vertices is given, then whenever 
we speak of corresponding sides or corresponding angles, we shall always be referring 
to the correspondence induced in the way shown above. 

Of course, not every one-to-one correspondence between the vertices of two tri- 
angles describes a workable scheme for moving the first triangle onto the second, 
even if the triangles happen to be congruent. For example, the corresponderice 


ABC — FED 


is unworkable; the triangles can't be made to fit in this way. To test whether a 
correspondence between the vertices is ^workable," we need to check whether the 
matching sides and angles are congruent. In fact, this is our official definition of 
a congruence. 


Derinivion. Given A ABC, ADEF, and a one-to-one correspondence 


ABC + DEF 


between their vertices. If every pair of corresponding sides are congruent, and 
every pair of corresponding angles are congruent, then the correspondence is a 


congruence. 
That is, the correspondence 
ABC — DEF 


is a congruence if all six of the following conditions hold: 


DE; ALA eu Dj 
DF, ZB ZE, 
BC = EF, Zoe zF. 


If ABC «+ DEF is a congruence, then we write 


AABC = ADEF. 
B f E 


A cD F 


Figure 6.3 


Two triangles are called congruent if there is some correspondence between their 
vertices which satisfies the six conditions for a congruence. Note that the expres- 
sion AABC & ADEF says not merely that A ABC and ADEF are congruent, 
but also that they are congruent in a particular way, that is, under the correspond- 
ence ABC €» DEF. 

Therefore, if we want to say merely that two triangles are congruent, we have to 
say this in words; we can't use the shorthand. It turns out, however, that this is 


a 
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no handicap to us, because in the geometry of triangles, the idea of congruence in 
the abstract hardly ever occurs. Nearly always, when we talk about congruent 
triangles, we go on to draw conclusions about “corresponding sides” or “correspond- 
ing angles”; and this means that what we really had in mind, all along, was a cor- 
respondence, whether or not we chose to mention one. The basic idea here is not 
the idea of congruence, but the idea of a congruence. 

It is fairly obvious that if you want to check that a correspondence is a con- 
gruence, you don’t have to check all six pairs of corresponding parts. For example, 
suppose that two sides and the included angle of the first triangle are congruent 
to the corresponding parts of the second, as the markings in the following figure 


indicate. E 


A [55 D F 
ABC © DEF; AB=DE, ZA 4D, AC=DF. 
FIGURE 6.4 


It ought to follow that ABC +> DEF is a congruence. In fact, this is our basic 
congruence postulate. 


SAS. Given a correspondence between two triangles (or between a triangle 
and itself). If two sides and the included angle of the first triangle are con- 


gruent to the corresponding parts of the second triangle, then the correspondence 
is a congruence. 


Here SAS stands for Side Angle Side. We shall refer to this postulate, hereafter, 
as the SAS postulate, or, for short, simply as SAS. 


PROBLEM Set 6.1 


These problems are not stated, and are not supposed to be solved, in terms of deductive 


geometry. You may take for granted that correspondences that look like congruences 
really are congruences. 


1. Write down all of the congruences between an equilateral triangle and itself. 
2. The figure to the right is a five-pointed star. Write 
down all of the congruences between the star and itself. B 
Let us agree that a congruence is simply a matching scheme 
that “works,” and that such a congruence is sufficiently 
described if we explain, in the one-line short notation, A Cc 
where the points A, B, C, D, and E of the star are supposed 
to go. Thus one of the congruences that we are looking for 


is ABCDE + CDEAB. 
3. Given a triangle A ABC which is isosceles but not 
equilateral. That is, AB = AC, but AB x BC. How 3 : 


many congruences are there, between A ABC and itself? Figure 6.5 
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6.2 THE BASIC CONGRUENCE THEOREMS 


An isosceles triangle is a triangle two of whose sides are congruent. A triangle 
which is not isosceles is called scalene. If all three sides are congruent, then the 
triangle is equilateral. The first and easiest consequence of the SAS postulate 
follows. 


Theorem 1. If two sides of a triangle are congruent, then the angles opposite 


them are congruent. 
A 


/ 5 ÉN 


B c 
FicvnE 6.6 


That is, the base angles of an isosceles triangle are congruent. Here, by the base 
angles we mean, of course, the angles opposite the two congruent sides. 

The marking of the figure gives a complete picture of the theorem. The marks 
on the sides AB and AC indicate that these sides are congruent by hypothesis. 
The marks for ZB and ZC, with exclamation points, indicate the conclusion 
that these angles are congruent. Throughout this book, exclamation points will 
be used in figures in this way, to indicate conclusions. 


ResraTemenr. Given A ABC. 1$ AB = AC, then ZB = ZC. 


Proof. Consider the correspondence 
ABC = ACB. 


Under this correspondence, AB o AG, AC o AB, and ZA © ZA. Thus two 
sides and the included angle are congruent to the parts that correspond to them. 
Therefore, by SAS, the correspondence is a congruence: AABC = AACB. By 
définition of a congruence, this means that ZB = ZC, which was to be proved. 

‘This is the famous pons asinorum theorem. The phrase pons asinorum means 
asses’ bridge, and was suggested by the figure which accompanied Euclid’s proof. 


Ficure 6.7 
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Euclid’s proof was rather long; it takes over a page in print. The proof given in 
this book is due, essentially, to Pappus, although Pappus naturally did not use the 
sort of formulation for the congruence postulates that we have been using here. 
Not many years ago—or so the story goes—an electronic computing machine was 
programmed to look for proofs of elementary geometric theorems. When the 
pons asinorum theorem was fed into the machine, it promptly printed Pappus’s 
proof on the tape. This is said to have been a surprise to the people who had coded 
the problem; Pappus’ proof was new to them. What had happened, of course, 
was that the SAS postulate had been coded in some such form as follows. 

“Tf (1) A, B, and C are noncollinear, (2) D, E, and F are noncollinear; (3) 
AB = DE; (4) BC = EF; and (5) ZABC = Z DEF, then (6) AC = DF; 
(7) ZACB = ZDFE; and (8) ZBAC = ZEDF." 


FIGURE 6.8 


This is the sort of austere language in which people commonly talk to vacuum tubes; 
you can't indoctrinate them with vague preconceptions and prejudices; ard so, 
if you want the vaeuum tubes to get the idea that the triangles (Fig. 6.8) in the 
SAS postulate are supposed to be different, you have to say so explicitly. It didn't 
oceur to anybody to do this, and so the machine proceeded, in its simple-minded 
way, to produce the simplest and most elegant proof. 


Corollary 1-1. Every equilateral triangle is equiangular. That is, in an ecui- 
lateral triangle, all three angles are congruent. Proof? 


Theorem 2. The ASA Theorem. Given a correspondence between two triangles 
(or between a triangle and itself). If two angles and the included side of the 
first triangle are congruent to the corresponding parts of the second, then th 
correspondence is à congruence. 


Restatement. Given A ABC, ADEF, and a correspondence ABC €» DEF. 
If ZA SZD, ZC = ZF, and AC = DF, then AABC = ADEF. 


A Cc D F 


FicvnE 6.9 
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Proof. (1) By Theorem C-2 of Section 3.6 there is a point B' on the ray DÉ, 
such that DB’ c AB. 

(2) By SAS, we have AABC = ADB'F. 

(3) By definition of a congruence, we have Z DFB' = ZACB. 

(4) By Theorem 2, Section 5.1, it follows that FB’ = FE. 

(5) Therefore B’ = E, because the lines DE and FE intersect in only one point. 

(6) Therefore, by (3), we have AABC œ= ADEF, which was to be proved. 

From this we get a corollary which is a converse of Theorem 1. 


Corollary 2-1. If two angles of a triangle are congruent, then the sides opposite 
them are congruent. 


The following corollary is a converse of Corollary 1-1. 
Corollary 2-2. Every equiangular triangle is equilateral. 
Proof? 


The third of the basic congruence theorems is somewhat harder to prove. 


Theorem 3. The SSS Theorem. Given & correspondence between two triangles 
(or between a triangle and itself). If all three pairs of corresponding sides are 
congruent, then the correspondence is a congruence. 
Restatement. Given AABC, ADEF, and a correspondence ABC €» DEF. 
If AB = DE, BC = EF, and AG = DF, then the correspondence is a con- 
gruence. 

B E 


Ficure 6.10 


Proof. Before proceeding with details, let us explain what the idea of the proof 
is going to be. First we are going to copy ADEF on the under side of AABC; 
that is, we are going to set up A ABC, with B’ on the opposite side of AC from B, 
so that AAB'C = ADEF. Second, we shall show that AABC = AAB'C.. It 
will follow that ABC = ADEF, which was to be proved. In full, the proof is 
as follows. 
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(1) By the angle-construction theorem (Theorem 2, Section 5.1) there is a ray 
AQ, with Q on the opposite side of AC from B, such that 


ZCAQ = ZEDF. 


(2) By the segment-construction theorem (Theorem C-2 of Section 3.6), there 
is a point B' of AQ such that B 
AB c DE. 


(3) Since we already know that AC œ DF, it follows by SAS that 
AAB'C = ADEF. 


(Thus we have completed the first part of our program.) 
m B'B intersects AC in a point G. (Because B and B' are on opposite sides of 
J) 

The proof now splits up into a number of cases. (i) A-G-C, as in the figure 
(Fig. 6.10). (ii) A = G. (iii) G-A-C. Strictly speaking, there are two more cases 
G = C and A-C-G, but these are essentially the same as (ii) and (iii). 

We proceed with the proof for Case (i). 

(5) ZABG = ZAB'G (by the isosceles triangle theorem), 

(6) ZCBG = ZCB'G (for the same reason). 


(7) G is in the interior of Z ABC. [Since A-G-C in Case (i), this follows from 
Theorem 4, Section 4.2.] 


(8) G is in the interior of Z AB'C. (For the same reason.) 

(9) By (5), (6), (7), and (8), together with Theorem 3, Section 5.1, it follows that 
ZABC = Z AB'C. 

(10) By SAS, it follows that 
AABC S AAB'Q. 

(11) Therefore, by (10) and (3), 


AABC = ADEF. 
(Proof?) 


For Cases (ii) and (iii), the figures look like this: 


& 
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Figure 6.11 
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| We leave these cases as problems. . 

| Roughly speaking, a bisector of an angle is a ray in the interior which splits the 
angle into two congruent parts like this. 

| B 


i 


| D 


c 
FicunE 6.12 


That is, AD bisects ZBAC if (1) D is in the interior of ZBAC, and (2) 
ZBAD = ZDAC. 


Theorem 4. Every angle has exactly one bisector. 


Proof. Given ZBAC. We lose no generality by supposing that AB = AC. 
We can always pick two points on different sides of the angle, equidistant from the 
vertex. 


Ficure 6.13 


Let D be the midpoint of BC. Then D is in the interior of ZBAC. And by the 
SSS theorem, AADC œ= AADB. Therefore ZBAD = ZCAD; and thus AD 
bisects ZBAC. 

Thus we have shown that every angle has at least one bisector. This is half of 
our theorem. We need next to show that ZBAC has at most one bisector. To 
do this, it will be sufficient to show that every bisector of Z BAC passes through 
the midpoint D of BC. 

Suppose that AB bisects ZBAC. Then automatically E is in the » interior of 
ZBAC. By Theorem 4, Section 4.3, it follows that AË intersects BC in some 
point D’, between B and C. By the SAS postulate we have AAD'B = AADC. 
Therefore D'B & D'C, and D' is the midpoint of BC. Since BC has only one mid- 
point, it follows that Z BAC has only one bisector, which was to be proved. 


E 
Figure 6.14 
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PnRoBLEM Ser 6.2 


1. Complete the proof of the SSS theorem. 
*2. Let the “special SAS postulate” be the statement that. you get if you add to the SAS 
postulate the condition that the triangles must be different. Show that the general form 
of the SAS postulate can be proved as a theorem, on the basis of the special SAS postulate. 


6.3 SOME REMARKS ON TERMINOLOGY 


The language in which we have been discussing congruence in this book is 
rather different from the usual language; it may be worthwhile to discuss the reasons 
for the changes. 

We have already explained the reason for speaking about congruences between 
triangles, in the sense of correspondences having certain properties, rather than 
speaking of the relation of congruence in the abstract. The reason, briefly, is that 
the former is what we really mean and what we really need. 

Our use of the word congruent in connection with segments and angles is a 
slightly different matter. Suppose that we have a pair of segments, a pair of angles, 
and a pair of circles which “match up” in the sense suggested by the following 


3 We. 


Figure 6.15 


The situation can then be described in two ways. (1) We can say (following 
widespread usage) that the segments are equal; and similarly for the angles and the 
circles. (2) We can say (following the usage of the present book) that in each of 
the three cases, the figures are congruent. 

The same people who call two segments equal if they have the same length also 
say that two triangles are equal if they have the same area. 

There are two difficulties with the loose use of the word equals to describe equality 
of length, angular measure, and area. The first difficulty is that if the word equals 
is used in this way, there is no word left in the language with which we can say 
that A is—without ifs, buts, qualification, or fudging—the same as B. This latter 
relation is called the logical identity. It may seem a peculiar idea, at first, because 
if two things are exactly the same, there can’t be two of them. But as soon as mathe- 
matics had begun to make heavy use of symbolism, the logical identity became 
important. For example, each of the expressions 


1 2 /3 + 1 
2/3 ar 11 
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describes a number. The descriptions are obviously different, but it is rather easy 
to check that both of the above expressions describe the same number; and this 
is what we mean when we write 


1 Be ee 
On/ Se a d 


The relation denoted by the symbol =, in the above equation, is the logical identity. 

The concept of the logical identity A = Bisso important, and comes up so often, 
that it is entitled to have a word to itself. For this good reason, in nearly all 
modern mathematics, the word equals and the symbol = are used in only one sense: 
they mean is exactly the same as. 

The second difficulty with the loose use of the word equals is that it puts us in 
the position of using two words to describe the same idea, when one word would 
easily do. Congruence is the basic equivalence relation in geometry. We may 
use different technical definitions for it, in connection with different types of 
figure, but the underlying idea is always the same: two figures are congruent if 
one can be placed on the other by a rigid motion. The basic equivalence relation 
of geometry is entitled to have a word all to itself; and the word congruence appears 
to be elected. 

It was, no doubt, for these reasons that Hilbert adopted, in his Foundations of 
Geometry, the terminology that we are using in the present book. The problem is 
not logical but expository. A good terminology matches up the words with the 
ideas in the simplest possible way, so that the basic words are in one-to-one cor- 
respondence with the basic ideas. 

It should be borne in mind that the strict mathematical interpretation of the 
word equals, in the sense of “is exactly the same as,” is a technical usage. In 
ordinary literary English, the word is used even more loosely than in Euclid. For 
example, when Thomas Jefferson wrote, in the Declaration of Independence, that 
all men are created equal, he obviously did not mean that there is only one man in 
the world, or that all men are congruent replicas of one another. He meant merely 
that all men have a certain property in common, namely, the property of being 
endowed by their Creator with certain unalienable rights. 

In fact, only mathematics and logic need a word and a symbol for the idea of 
“is the same as,” and the need did not appear even in mathematics and logic until 
the heavy use of symbolism developed. This development came along after Euclid; 
and sheer force of habit preserved Euclid's terminology long past the time when 
it had become awkward in the context of modern mathematics. 


6.4 THE INDEPENDENCE OF THE SAS POSTULATE 


We have seen that the ASA theorem and the SSS theorem can be proved on the 
basis of the SAS postulate. The question may arise whether the SAS postulate 
can be turned into a theorem; that is, proved on the basis of the postulates that 
precede it. 
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Certain general considerations suggest the contrary. If you reconsider the ruler 
postulate, you will see that it deals with lines one at a time. It does not seem to 
claim that there is any connection between distances measured along one line and 
distances measured along another. The first postulate that describes such a connec- 
tion is SAS; SAS tells us, among other things, that if AB c DE, ZA = ZD 
and AC & DF, then BC & EF. 


B E 
4 : : z 
A CD F 


Ficure 6.16 


Here, except in very special cases, the distances BC and EF are measured along 
different lines. 

We observe, moreover, that the postulates for the angular-measure function m 
do not mention distance at all. 

These considerations strongly suggest that SAS gives genuinely new information. 

Given a set of postulates, say, Py, Po,... , Pn, for a mathematical structure. 
We say that P, is independent of the other postulates P1, P5, ..., P, , if there is 
a mathematical system which satisfies P;, Ps »+++,Pn_y but does not satisfy 
Pa. For example, the postulate which says that every a ~ 0 has a reciprocal 
a is independent of the other field postulates. The easiest way to see this is to 
observe that the integers satisfy all of the field postulates, with the sole exception 
of this one. 

We shall give an example of the same sort to show that SAS is independent of 
the preceding postulates of metric geometry. 

Consider a structure [S, 6 £, d, m] satisfying all of the postulates of metric 
geometry. We may think of this system as coordinate three-dimensional space, 
with the usual definitions of distance and angular measure. We shall define a new 
distance function d’, by making a slight change in the “normal” distance function 
d. We do this in the following way. We choose a particular line L at random. We 
agree that d'(P, Q) is to be the old d(P, Q), except when P and Q both lie on L, 
in which case d'(P,Q) = 2d(P,Q). It is plain that d’ satisfies the ruler postulate 


R P te Tr 


P era. R' 
Fiure 6.17 


EXISTENCE OF PERPENDICULARS 93 


on all lines except perhaps for L. And on L, the ruler postulate also holds. Given 
a coordinate system f which works for the old d, we merely set f'(P) = 2f(P), and 
the new coordinate system now works for d'. 

The SAS postulate now fails for the old m and the new d’ (Fig. 6.17). We ought 
to have APQR = AP'Q'R', because d'(R, P) = d'(R', P), ZR = ZR’ and 
d'(R,Q) = d'(R',Q'). But the congruence between the triangles does not hold, 
because d'(P, Q) = 2d'(P', Q’). 

It is possible also to show the independence of SAS by leaving d unchanged and 
defining a peculiar angular measure m' for angles with a certain point P as vertex. 
But examples of the latter sort are considerably harder to describe and to check. 


6.5 EXISTENCE OF PERPENDICULARS 


You may have noted that in the proof of the SSS theorem in Case (i), we had 
GB L AG. The reason is as follows. By SAS, we have AAGB œ AAGB'. There- 
fore Z AGB œ Z AGB'. Since these two angles form a linear pair, they are sup- 
plementary. Thus Z AGB is congruent to a supplement of itself, and hence is a 
right angle. Therefore our proof of the SSS theorem included, implicitly, a proof 
that “a perpendicular can always be drawn, to a given line, through a given ex- 
ternal point.” We make this explicit in the following theorem. 


Theorem 1. Given a line and a point not on the line, then there is a line which 
passes through the given point and is perpendicular to the given line. 
Proof. Let L be the line, and let B be a point not on L. Let A and C be any two 


points of L. By the angle-construction theorem, there is a point Q such that 
(1) B and Q are on opposite sides of L, and (2) ZBAC = ZQAC. 


Figure 6.18 


By the segment-construction theorem, there is a point B’ of AQ such that AB = 
AB’. Now there are two possibilities: 

(1) G s» A. In this case it follows by SAS that AAGB 2 AAGB'. Therefore 
Z AGB œ Z AGB'. Since these two angles form a linear pair, they are supple- 
mentary. Therefore each of them is a right angle. Therefore BG L AG, as desired. 

(2) G = A. In this case ZBGC œ= ZBAC and ZB'GC = ZQAC. Therefore 


Z.BGC = Z B'GC; and it follows, as in Case (1), that BG 1 AC. 
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Notice that Case (2) is “unlikely,” because A and C were chosen at random on 
the line. But it is possible that we happened to pick the foot of the perpendicular. 

Notice also that if you use the full force of the angle-construction postulate 
(M-3), then you can prove easily that given a line L in a plane E, and a point A 
which is on the line, there is always a line in E which contains A and is perpendicular 
to E. So far, however, we have been making it a point not to use the “protractor 
postulates,” but to use instead the first few theorems based on them; and so we 
do not yet state the above theorem officially. 


CHAPTER 7 


GEOMETRIC INEQUALITIES 


Up to now, in our study of the geometry of the triangle, we have been dealing 
only with congruence; our theorems have stated that under certain conditions we 
can infer that two segments (or two angles) are congruent. We shall now investi- 
gate conditions under which we can say that one segment is larger than another, 
or that one angle is larger than another. 

Initially, we defined inequalities between angles by means of measure. That 
is, ZABC < ZDEF if mZ ABC < mZDEF. It is clear, of course, that the 
same idea can be described simply in terms of congruence, without any regard to 
the source of our concept of congruence. We can say that ZABC < ZDEF if 
there is a point G, in the interior of Z DEF such that ZABC = ZGEF. 


bal: ja 
G 
B G E F 


Figure 7.1 


Similarly, for segments, we may say that AB is shorter than CD (AB < CD) 
if the distance AB is less than the distance CD. Or we may forget where the idea 
of congruence came from, and say that if there is a point E, between C and D, 
such that AB œ CE, then AB < CD. 


A B € E D 
Figure 7.2 


We now proceed to investigate the geometric inequalities associated with a fixed 
triangle. 
In the figure below, the angle BCD is called an exterior angle of A ABC. More 
precisely, if A-C-D, then ZBCD is an exterior angle of AABC. 
B 


A Gap 


FIGURE 7.3 
95 
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Every triangle has six exterior angles, as indicated in the following figure, and 
these six angles form three vertical pairs. 


B 


FIGURE 7.4 


It follows, of course, that the two exterior angles at a given vertex are always con- 
gruent. The angles ZA and ZB of AABC are called the remote interior angles 
of the exterior angles with vertex at C; similarly, ZA and ZC are the remote 
interior angles of the exterior angles with vertex at B; and so also for the third case. 


Theorem 1. Any exterior angle of a triangle is greater than each of its remote 
interior angles. 


RESTATEMENT. Given AABC. If A-C-D, then ZBCD > ZB. 


First we observe that the restatement really does convey the entire content of 


the theorem. ie 


D 
A E 


^. 


Dœ 
FIGURE 7.5 


If we prove that the restatement holds true, then we will also know that the 
other exterior angle at C is greater than Z B, because the two exterior angles at C 
are congruent. It will also follow, merely by a change of notation, that Z ACD’ > 
ZA; and this means that the exterior angles at C are greater than each of their 
remote interior angles. 

We now proceed to the proof of the restatement. 

Let E be the midpoint of BC. 


B F 


A ej D 
FIGURE 7.6 
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By Theorem C-2, there is a point F such that A-E-F and EA = EF. 

Consider AAEB and AFEC. Now EB = EC, by hypothesis for E; EA = EF, 
by hypothesis for F;and ZAEB = ZFEC, because vertical angles are congruent. 
By SAS it follows that the correspondence 


AEB © FEC 


is a congruence; that is, AAEB œ AFEC. Therefore, ZB = ZBCF. 
By Theorem 5, Section 4.2, we know that F is in the interior of ZBCD. There- 


fore, ZBCF < ZBCD; and therefore ZB < ZBCD, which was to be proved. 


Corollary 1-1. The perpendicular to a given line, through a given external point, 
is unique. 
RESTATEMENT. Let L be a line, and let P be a point not on L. Then there is 
only one line through P, perpendicular to L. 
Proof. Suppose that there are two perpendiculars to L through P, intersecting 
Lin points Q and R. We shall show that this is impossible. 
P 


Q R S 


FIGURE 7.7 


Let S be a point of L such that Q-R-S. Then ZPRS is an exterior angle of 
APQR; and ZPQR is one of its remote interior angles. This is impossible, because 
both ZPQR and ZPRS are right angles. (See "Theorem 6 of Section 5.1.) 


Theorem 2. If two sides of a triangle are not con; ruent, then the angles op- 
posite them are not congruent, and the greater angle is opposite the longer side. 


Restatement. Given AABC. If AB > AC, then ZC > ZB. 


Proof. Let D be a point of AC, such ı that AD = AB. Then A-C-D, as the 
figure indicates, because AD z AB » AC. Since the base angles of an isosceles 
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Figure 7.8 
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triangle are congruent, we have 
ZABD = ZD. (1) 


Since A-C-D, it follows by Theorem 4, Section 4.2, that C isin the interior of Z ABD. 
Therefore 


ZABC < ZABD; (2) 
therefore 
ZABC « ZD. (3) 
Since Z ACB is an exterior angle of ABCD, we have 
ZD « ZACB. (4) 
By (3) and (4), 
ZABC < ZACB. 
Thus, in A ABC we have 
NAPAA 


which was to be proved. 


Theorem 3. If two angles of a triangle are not congruent, then the sides opposite 
them are not congruent, and the larger side is opposite the larger angle. 


Restarement. Given AABC. If ZB < ZC, then AC < AB. 
Proof. (1) If AC e AB, then by the isosceles triangle theorem it would follow 
that ZB = ZC; and this is false. 


(2) If AC > AB, then by Theorem 2 it would follow that ZB > ZC; and 
this is false, 


The only remaining possibility is AC « AB, which was to be proved. 


Theorem 4. The shortest segment joining a point to a line is the perpendicular 
segment. 


RESTATEMENT. Let L be a line, let P be a point not on L, let Q be the foot of the 
perpendicular to L through P, and let R be any other point of L. Then PQ « 
PR. 


P 
Sur oO R L 
Figure 7.9 


Proof. Let S be a point of L such that S-Q-R. Then Z PQS is an exterior angle 
of APQR. Therefore ZPQS > ZPRQ. Since PQ L L, we know that ZPQS = 
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Theorem 5. In any triangle, the sum of the lengths of any two sides is greater 
than the length of the third side. 


Restatement. If A, B, and C are noncollinear, then AB + BC > AC. 
Proof. Let D be a point of CB such that C-B-D and BD = BA. Then 
CD = AB + BC. (1) 
Now B is in the interior of Z DAC, by Theorem 4 in Section 4.2; therefore 
ZDAB < ZDAC. (2) 


Since ABAD is isosceles, with BA = BD, it follows that 2D = ZBAD; 
therefore 


ZD « ZDAC. e (8) 


By applying Theorem 3 to AA DC, we get CD > AC; and this may be expressed 
in terms of distance, as follows: : 


„~ 
CD > AC. (4) V 
\ re! 
By (1) and (4), we have \ Er E 
A —.c 
AB 4- BC > AC, 
FicunE 7.10 


which was to be proved. 

Note that this theorem was stated and proved in terms of distance, rather than 
in terms of the relation of congruence between segments. Here we have departed 
from the style of the preceding few chapters. The reasons for this departure will 
be explained in the following section. (The questions involved here are more com- 
plicated than you might think.) 


Theorem 6. If two sides of one triangle are congruent, respectively, to two sides 
of a second triangle, and the included angle of the first triangle is larger than the 
included angle of the second triangle, then the opposite side of the first tri- 
angle is larger than the opposite side of the second triangle. 


Restatement. Given AABC and ADEF. If AB = DE, AC = DF, and 
ZA > ZD, then BC > EF. 
B 
E 


ese 


A Cc D F 
Figure 7.11 
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Proof. (1) We assert that there is a point K, in the interior of ZBAC, such that 


AAKC = ADEF. 
B 


a 
Q 


FIGURE 7.12 


To show this, we first take a point Q, on the same side of AG as B, such that 
ZQAC = ZEDF. (This is by the angle-construction theorem.) Since ZA > 
£D, Q is in the interior of Z BAC; in fact, every point of AQ — A isin the interior 
of ZBAC. Let K be the point of AQ such that AK œ DE. By SAS, we have 
AAKC = ADEF, which is what we wanted. 

(2) Next, let AR be the bisector of ZBAK. 


R 
B 
M 
L 
A c 
FiaunE 7.13 


(See Theorem 4 of Section 6.2, which asserts that every angle has exactly one 
bisector.) By the cross-bar theorem, AK intersects BC in a point L; and by 


another application of the eross-bar theorem, AR intersects BL in a point M. 
(3) By SAS, we have 
AABM = AAKM. 


'Therefore MB — MK. By Theorem 5, we know that 


CK « CM + MK. 
Therefore 


CK < CM + MB, 


because MB = MK. Now CM + MB = CB, because C-M-B. And CK = EF, 
because AAKC œ= ADEF. Therefore, we finally get 


EF « CB, 
which was what we wanted. 
The above proof is correct and complete as it stands, but it works for reasons 
that are a little trickier than one might suspect. The last of the figures given does 
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A Cc A c 


Figure 7.14 


not indicate all of the possibilities. The figures might look like either of those 
above. The proof, that we have given applies word for word, to the figure on the 
left, and for thé figure on the right we merely need to give a different reason for 
the inequality CK < CM + MK. 

Finally, we observe that the SAA theorem can be proved merely on the basis of 
the exterior angle theorem, without the use of the parallel postulate or theorems 
based on it. 


Theorem 7. The SAA Theorem. Given a correspondence between two tri- 
angles. If two angles and a side of the first triangle are congruent to the cor- 
responding parts of the second, then the correspondence is a congruence. 


Restatement. Given AABC, ADEF, and ABC €? DEF. If AB S DE, 
ZA & ŻD, and ZC = ZF, then AABC = ADEF. 


B 


Figure 7.15 


(Note that the case where the given side is included between the two angles 
has already been taken care of by the ASA theorem.) 
Proof. Let F’ be a point of DF, such that DF” = AC. By SAS we have 


AABC.= ADEF”. 


Therefore ZF’ = LC = ZF. But we must have (1) D-F-F', (2) D-F'-F or 
(3) F = F'. If D-F-F', then ZF is an exterior angle of AEFF", so that ZF > 
ZF’, which is false. If D-F'-F, then ZF” is an exterior angle of AEFF', and 
ZF’ > ZF, which is also false. Therefore F' = F, and AABC = ADEF, 


which was to be proved. 


CHAPTER 8 


THE EUCLIDEAN PROGRAM: 
CONGRUENCE WITHOUT DISTANCE 


| ee eS... ———— 


8.1 THE SYNTHETIC POSTULATES 


In our treatment of geometry so far in this book, the real numbers have played 
a central role. You will recall that the basic structure is 
(S, £, &, d, m), 
where d and m are real-valued functions, defined for point pairs and angles, re- 
spectively. The fundamental ideas of congruence for segments, betweenness for 
points on a line, and congruence for angles were defined in terms of distance and 
angular measure in the following way. 


(1) A-B-C means (by definition) that A, B, and C are different points of the 
same line, and 
AB + BC = AC 


(where PQ is the distance d(P, Q) between P and Q). 
(2) AB is defined as the union of A, B, and all points between A and B. 
(3) AB = CD means (by definition) that AB = CD. 
(4) ZA £& ZB means (by definition) that mZ A = mZ B. 


Under this scheme, nearly all the basic properties of betweenness and con- 
gruence for segments and angles could be proved as theorems; the only exception 
was the SAS postulate. 

This scheme of presentation for geometry is not the classical one. It was proposed 
in the early 1930’s by G. D. Birkhoff, and has only recently become popular. The 
classical scheme, as one finds it in Euclid or in David Hilbert’s Foundations of 
Geometry, is quite different; the basic difference is that in Euclid the real numbers 
do not appear at all at the beginning, and appear only in a disguised form even 
at the end (see Chapter 20). Euclid’s treatment of geometry is called the synthetic 
treatment. The Birkhoff scheme is called metric geometry, because it uses measure- 
ment. 

We now give a sketch of how the Euclid-Hilbert treatment works. At the 
beginning, of course, we have the system 


IS, £, 9]. 
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(If we weren’t going to talk about points, lines, and planes, we could hardly be 


said to be talking about geometry.) The treatment of incidence theorems is exactly 
like the one given in Chapter 2. Immediately thereafter, however, a basic difference 
appears. Instead of adding to our structure the real-valued functions d and m, 


we add the following things. 


(1) An undefined relation of betweenness, for triplets of points. This is regarded 
as given, in the same way that d and m were regarded as given, subject to certain 


postulates to be stated soon. 
Segments are defined, as before, in terms of betweenness. We then add to our 


structure the following: 
(2) An undefined relation of congruence for segments, and 
(3) An undefined relation, also called congruence, for angles. 


It does no harm to denote our two undefined congruence relations by the same 
symbol, =. If we represent our betweenness relation as ®, then the structure 
becomes , 

(S, £, 9, &, =I. 


It is now out of the question to prove theorems about betweenness and con- 
gruence until we have enough postulates to describe their basie properties. These 
are in three groups, as follows. 


Betweenness Postulates 
B-1. If A-B-C, then C-B-A. 
B-2. Of any three points of a line, exactly one is between the other two. 
B-3. Any four points of a line can be named in an order A, B, C, D, in such a 
way that A-B-C-D. 
B-4. If A and B are any two points, then (1) there is a point C such that A-B-C, 
and (2) there is a point D such that A-D-B. 
B-5. If A-B-C, then A, B, and C are three different points of the same line. 


These statements are, of course, the same as the theorems that were proved 
on the basis of the ruler postulate in Section 3.4, where betweenness was defined 


in terms of distance. 


Congruence Postulates for Segments 
C-1. For segments, congruence is an equivalence relation. 
C-2. The Segment-Construction Postulate. Given a segment AB and a ray CD. 
There is exactly one point E of CD such that AB = CE. 
C-3. The Segment-Addition Postulate. If (1) A-B-C, (2) A'-B'-C', (3) AB = 
WB’, and (4) BC = B'C', then (5) AC = AC. 
C4. If (1) A-B-C, (2) A-B'C', (8) AB = WB, and (4) AC = AC, then 
(5) BC = BC’. 


104 CONGRUENCE WITHOUT DISTANCE 


C-5. Every segment has exactly one midpoint; that is, for every AB there is 
exactly one point C such that A-C-B and AC & CB. 


These statements are, of course, the theorems that were proved in Section 3.6, 
where congruence for segments was defined in terms of distance. 

The separation postulates and the treatment of sides of a line, interiors of angles, 
and so on, are exactly as in Chapter 4. 


Congruence Postulates for Angles 
C-6. For angles, congruence is an equivalence relation. 
C7. Let ABC be an angle, let B'C be a ray, and let H be a half plane whose 
M — —— 

edge contains B’C’. Then there is exactly one ray B’A’, with A’ in H, such 

that ZABC > Z A'B'c'. 

C-8. If (1) D is in the interior of ZBAC, (2) D' is in the interior of Z.B'A'C', 

(3 ZBAD S Z B' A'D', and (4) ZDAC = Z D'A'C', then (5) ZBAC c 

Z B' A'C". : 

C-9. If (1) D is in the interior of ZBAC, (2) D' is in the interior of Z.B/4 'C", 

(3) ZBAD = Z B'A'D', and (4) ZBAC = Z B'A'C', then (5 ZDAC = 

ZAC, 

These statements are, of course, the statements that were proved in Section 5.1, 
where congruence for angles was defined in terms of the angular-measure func- 
tion m. i 

Finally, we state the SAS postulate exactly as before. 

Let us now consider the question of how much of our work in the preceding 
sections needs to be done over again if we decide, for some reason, that it would 
be good to get along without using the real-valued functions d and m. The answer 
is that practically none of it needs to be done over again, because until we got to the 
preceding chapter, on geometric inequalities, we almost never cited the definitions of 
betweenness and congruence except at the very beginning in each case, when we were 
proving the basic theorems that we now propose to take as postulates. The next time 


to a line L, through a point P, in a given plane. We showed (Theorem 1, Section 
6.5) that when P is not on L, such a perpendicular always exists. We still need 
to prove, on the basis of the postulates in this chapter, that when P lies on L, 
the same conclusion follows; and we still need to prove, on the same postulational 
basis, that all right angles are congruent, 

In the following section, we complete the treatment of congruence by proving 
these two theorems. In the next section, we show how the theory of geometric 
inequalities can be handled on the basis of purely synthetic postulates. 
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p 8.2 RIGHT ANGLES, SYNTHETICALLY CONSIDERED 


The first step in handling right angles is to prove the following theorem, which 
is closely related to C-8. 


Theorem 1. If (1) ZBAC and ZCAD form a linear pair, (2) ZB'A'C' and 
ZC’ A'D' form a linear pair, and (3) ZCAD = ZC'A'D', then (4) ZBAC = 
BAC, 


Figure 8.1 


Proof. Evidently there is no harm in assuming that 
ABZ AB, ACZ AC, ADZAD, 


since all six of our rays are equally well described if the points B’, C’, D' are chosen 
in this way. By SAS, we have AADC = AA'D'C'. Therefore ZADC = 
ZA'D'C'; therefore ZBDC = ZB’D’C’. By SAS, this means that ABDC = 
AB'D'C’. Therefore BC & B'O’. By the SSS theorem then, ABAC = AB'A'C'. 
Therefore ZBAC œ= ZB’A’C’, which was to be proved. 


Theorem 2. Any angle congruent to a right angle is also a right angle. 


Restatement. Suppose that (1) ZBAC and ZCAD form a linear pair, and 
(2) ZBAC = ZCAD. Suppose that (3) 2B’A’C’ and ZC'A'D' form a linear 
pair, and (4) ZB'A'C' = ZBAC. Then (5) ZB'A'C' = ZC'A'D'. 


c [od 


Ficure 8.2 


You should analyze this restatement carefully to see that it really is a restate- 
ment of Theorem 2. Once we have formulated the theorem in this way, the proof 
is trivial, By the preceding theorem, ZC'A'D' = ZCAD. By (2), ZCAD S 
ZBAC. Therefore 

ZC' A'D' = ZBAC = ZB'A'C', and ZC' A'D' = ZB'A'C', 


which was to be proved. 
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Theorem 3. All right angles are congruent. 


RESTATEMENT. Suppose that ZBAC and ZCAD form a linear pair and are 
congruent. Suppose that Z.B'A'C' and ZC' A'D' form a linear pair and are 
congruent. Then Z BAC = Z B' A'(v, 


Figure 8.3 


Proof. First we observe that the points B’, and D’ can be chosen so that 
AB’ = AD, 
as indicated in the figure. It follows, by SAS, that A A'B'C' e A A'D'C', so that 
we have also ZB e Zp’ 


Let A'É be a ray, with E on the same side of À/D' as C’, such that 
ZB'A'E = ZBAC. 


We need to prove that AE = A0’; it will then follow that ZBAC c Z B'A'C', 
It A"É = AC, then E lies in the interior of one of the angles, ZB'A'C', 
ZC'A'D'. Suppose that E is in the interior of Z B' A'C', as shown in Fig. 8.3. 
Then A’E intersects BC’ in a point F. Let G be a point between C" and D', such 
that D'G = BF. By SAS we have 
AB'A'F = AD'A'G. 
Therefore 
ZB'A'F = ZD'A'G. 
But by Theorem 1, we know that Z D'A'E = ZCAD. Therefore Z D'A'E c 
ZCAD = ZBAC £& ZB'A'E c Z D'A'G. 
We now have the following situation: 
(1) G and E are both on that side of 170 which contains C", 
(2) ZD'A'E = Z D'A'G. 
(3) FË and AG are different, because E and G are on opposite sides of ÅO 
(the sides that contain B’ and D’, respectively). 
This is impossible, by C-7, because that postulate says that conditions (1) and 
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are congruent, and Hilbert showed that the postulate was unnecessary.) Hilbert’s 
postulates, in Foundations, were considerably weaker than those given in Section 
8.1. In this book, our purpose is not to give the weakest postulates that can be 
made to work, but merely to give a valid and workable scheme. 


Theorem 4. Given a plane Æ, a line L, and a point P of E. There is exactly one 
line in E which contains P and is perpendicular to L. 


Proof. We already know that this is true for the case in which P is not on L. 
We also know, by Theorem 1, Section 6.5, that some angles are right angles. Let 
Q be any point of L other than P. By C-7, there is a point R on one side of L in 
E, such that ZQPR is congruent to a right angle. By Theorem 2, this means that 
ZQPR is & right angle, therefore PR LL. If there were two such rays PR, 
PR’, then we would have ZQPR œ= ZQPR', because all right angles are con- 
gruent. This is impossible, by C-7. 


E42 
FIGURE 8.4 
Note that in a metric treatment we see immediately that ZA is a right angle if 


and only if mZA = 90. Thus, in a metric treatment all the theorems in this 
section are almost too trivial to be worthy of explicit statement. 


8.3 GEOMETRIC INEQUALITIES, SYNTHETICALLY CONSIDERED 


At the start we have to explain, without mentioning distances, what it means to 
say that one segment is shorter than another or that one angle is smaller than 
another. The appropriate definitions have already been suggested in Chapter 7. 


Derinition. AB < CD if there is a point B’, between C and D, such that 
AB = CB’. 


o—____+_-* ——__+}+_*+* 
A B C B D 
Figure 8.5 


The basic properties of the relation < are 

(I) For every pair of segments AB, CD, exactly one of the following conditions 
holds: Bye) PAGI = 
AB «OD, -AB=CD, CD < AB. 
(Il) If AB < CD and CD < EF, then AB < EF. 
(III) It AB c AB’, CD = CD’ and AB < CD, then AB’ < CD’. 
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Note that (I) and (II) are very much like Conditions O-1 and O-2 for an order 
relation, with = replaced by Œ, throughout. 


Derinition. Given Z ABC and ZDEF. If there is a point G in the interior of 
ZDEF, such that ZABC = ZGEF, then ZABC < ZDEF. 


D 
A 


Ficure 8.6 


The basic properties of this relation are precisely analogous to (I), (II), and 
(III) above. 


(IV) For every pair of angles Z A, ZB, exactly one of the following conditions 
holds: 
ZI e ZB: ZA = ZB, ZB SLA. 


(V) If ZA < ZBand ZB < ZC, then ZA < ZC. 
(VD If ZA & ZA’, ZB = ZB' and ZA < ZB, then ZA’ < ZB’. 


All these statements look plausible enough, but the proofs are not trivial. In 
the following section, we shall give the proofs of (I), (II), and (III), which will 
be sufficient for our present purposes. In the meantime, let us consider how the 
theory of geometric inequalities works, from the purely synthetic viewpoint. 

Adopting the purely synthetic interpretation for inequalities between angles and 
segments, we revisit Chapter 7, and find that everything goes through in essentially 
the same way as before, until we get to Theorem 5, Section 7.1, which asserts that 
in any triangle AABC, we have AB + BC > AC. Notice that our troubles do 
not even postpone their arrival until the start of the proof. Our first problem is to 
state the theorem without talking about distances. If we feel sufficiently desperate, 
we might come up with the following rather clumsy and artificial formulation. 

Theorem 1. Given AABC, there is a point A’ such that A'B & AB, A'-B-C 

and AC > AC. 


A 


Figure 8.7 
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This really does convey the idea, because it says, intuitively speaking, that if 
AB and BC are laid end ‘to end, they form a segment which is longer than AC 
(Fig. 8.7). 

Note that the difficulty in handling this theorem synthetically is that it deals 
with the idea of addition. In metric geometry, the situation is quite simple, be- 
cause the addition is performed with numbers, and addition of numbers is common- 
place. In synthetic geometry, we need to do a lot more talking, if we want to make 
sense, because the “sum” of two segments can be regarded as a segment only when 
the segments are end to end, like this: 

uM HÀ À 
A B C 
FIGURE 8.8 


Here it is reasonable to call AG the “sum” of AB and BC. But if the segments 


look like this: 
A ncn 
A D 
FicunE 8.9 


it is not plain what segment their sum ought to be. — . ue 

The easiest way around this is as follows. Given 7 AB, let [AB] be » the set of all 
segments that are congruent to AB. Obviously, if AB = CD, then [AB] = (CD). 
The sets [AB] are called congruence classes. What we have just pointed out is that 
a congruence class is equally well described by any of its members. 


Suppose now that any two segments AB and CD are given. Then there are 
always points X, Y, Z such that 


x-Yz, xXY2 AB, Y= CD. 


The following observations are easy to check on the basis of the congruence 


postulates. 
(1) If X’, Y', Z' are any other three points satisfying the same conditions, then 
it follows by the segment-addition postulate that X 7Z/ = XZ. That is, the con- 
gruence class [XZ] is independent of the choice of X, Y and Z. n 
(2) Suppose that AB = WB’ and CD = C'D'. Let X, Y, Z be chosen for AB 
and CD, as above; and let X*, Y',Z' be chosen for A'B’ and C'D'. Then XZ = 
XZ. That is, the congruence class [XZ] depends only on the congruence classes 
[AB] _and (CD); it is independent of the choice of representative segments AB 
and CD. 
Addition can now be defined, not between segments, of course, but between 
congruence classes. Given [AB] and (CD), we take X, Y, and Z such that 


x-yz, Ye AB, YZ=CD. 
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Then, by definition, fix m e 
[AB] 4- [CD] — [XZ]. 


Our remarks [(1) and (2)] show that this definition makes sense. The con- 
gruence class [XZ] really is independent of the choice of AB, CD, X, Y, and Z; 
it depends only on the congruence classes [AB] and [CD]. 

Finally, we recall that (III) tells us that if AB « CD, then any segment in 
[AB] is less than every segment in [CD]. We can therefore define 


[AB] < [CD] 


to mean that every segment congruent to AB is less than every segment congruent 
to CD. 

Having built up all of this apparatus, we can finally give a more natural looking 
statement of our triangular inequality theorem, 


Theorem 2. For any triangle A ABC, we have 
[AB] + [BC] > [AC]. 


The metric form given in Chapter 7 seems preferable. Surely it lends itself 
better to elementary exposition. One advantage of the metric scheme is that it 
permits us to speak simply, logically, and intelligibly, all at the same time. 


8.4 PROOFS OF THE LAWS OF INEQUALITY FOR SEGMENTS 


Here we give the proofs of (I), (II), and (III) of the preceding section. 


Theorem 1. If A-B-C and A-C-D, then A-B-C-D. 


Proof. We know that A, B, C, and D can be arranged in an order W, X, Y, Z, - 
so that W-X-Y-Z. Here W cannot be B or C, because W is not between any two of | 
the other three points. For the same reason, Z cannot be B or C. Therefore W 
and Z must be A and D in some order. Since W-X-Y-Z and Z-Y-X-W say the 
same thing, we can assume W = A, and Z = B. This gives 


A-X-Y-D, 


where X and Y are B and C. We cannot have A-C-B-D, because A-B-C. There- | 
fore we have A-B-C-D, which was to be proved. 


Theorem 2. If A-B-C and AC c AC’, then there is a point B’ such that 
A'-B'-C' and AB = A’B’. 


B"? B 
-——TLH ——————5$ —dHÀ——M $9 
A B € a 
Figure 8.10 
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Proof. ‘By the segment-construction postulate, C-2, we know that there is 
exactly one point B’ on the ray AC such that A'B’ œ AB. There are now three 
possibilities: 

(1). - B^ zu 05 (2) A'-C'-B', (3) A'-B'-C'. 
We shall show that both (1) and (2) are impossible. It will follow that (3) holds 
true. 

(1) Suppose that B’ = C'. Then the ray AG contains two points X (namely, 
X = B and X = C) such that AX = AC’. This contradicts the segment- 
construction postulate, C-2. 

(2) Suppose that A'-C'-B'. By the segment-construction postulate there is a 
point D on the ray opposite to CA such that CD = CB’. 


A’ a B 
—L——.———— 
A B C 
Fıcure 8.11 


Thus A-C-D, A'-C'-B', AC = AC’, and CD = C'B'. Therefore, by the segment- 
addition postulate, we have AD = A'B’. Since WB’ = AB, this gives AD c 
.AB, which contradicts the uniqueness condition in the segment-construction 
postulate. 


Theorem 3. If AB < CD, and CD = C'D', then A 
Proof. We have a point B' such that C-B'-D and CB' 


there is a point B" such that C'-B"-D' and C B? = CB’. But AB = CB’. There- 
fore 
AB=CB", ad AB«CD, 


which was to be proved. 


Theorem 4. If AB < CD, and A'B' = AB, then A'B/ < CD. 

This is easy even without the preceding theorems. Proof? Fitting these together, 
we get: 

Theorem 5. If AB = AVE", CD = CD’, and AB < CD, then AB < CD. 
This, of course, is (III) of Section 8.3. 


Theorem 6. For every pair of segments AB, CD, exactly one of the following 
conditions holds: 


AB«CD, ABzCD, CD < AB. 
[This is Condition (I).] 
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Proof. There is a point B’, on CD, such that CB’ = AB. 


Figure 8.12 


If C-B’-D, then (1) AB < CD. If B' = D, then (2) AB & ČD. If C-D-B', 
then by Theorem 2 it follows that there is a point D’, between A and B, such that 
AD' = CD. Therefore (3) CD < AB. 

Conversely, if (1) holds, then C-B’-D, because there is only one point satisfying 
the conditions for B’. For the same reason, if (2) holds, then B’ = D. 

We shall show, finally, that if (3) holds, then C-D-B’. Given that CD < AB, 
we know that there is a D’ such that A-D'-B and AD’ = CD. 


— y e y o pele es N re ay eee eee 
A D' B C D B" 


FIGURE 8.13 


There is a point B", on the ray opposite to De, such that DB" œ D'B. Therefore 
C-D-B". Therefore CB" AB. (Why?) Therefore B" = B’, and C-D-B'. 

Since exactly one of the conditions C-B’-D, B’ = D, C-D-B' holds, it follows 
that exactly one of the equivalent conditions AB < CD, AB = CD, CD < AB 
must hold. 


Theorem 7. If AB < CD and CD < EF, then AB < EF. 
A B 
| 
C———_ 42 4p 
I 1 
I | 
d i i 


B" D' 
FIGURE 8.14 


Proof. Take D' so that E-D'-F and ED' = CD (Fig. 8.14). Take B' so that 
C-B'-D and CB’ £& AB. By Theorem 3, there is a point B" such that E-B"-D'. 
Since E-B"-D' and E-D'-F, it follows by Theorem 1 that E-B"-D'-F, so that 
E-B"-F. Therefore AB < EF, which was to be proved. This is (II) of Section 8.3. 


8.5 A SUMMARY OF THE DIFFERENCES BETWEEN THE METRIC AND SYNTHETIC 
APPROACHES 


We have by now described two fundamentally different approaches to geometry. 
It may be helpful, by way of outline and review, to give a table indicating how these 
two approaches differ (Table 8.1). The basic ideas and statements appear in the 
left-most column, and in the next two columns we indicate how they are treated. 
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TABLE 8-1 
. Metric approach Synthetic approach 
1. The given 
COPA (S, £, 6, d, m] IS, £, 6, 8, 2] 
2. Distance MK I Never mentioned 
3. Measure for Given, in the ‘ 
angles structure Never mentioned 
4, Congruence Defined in terms Given, in the 
B for segments of distance structure 
5. Congruence Defined in terms of Given, in the 
for angles degree measure structure 
6. Properties of Stated in Stated in 
congruence theorems postulates 
7. Additi Performed with Performed with 
p os numbers AB congruence classes [AB] 
Defined between 
Defined between 
8. Inequalities congruence classes, 
numbers, AB. « CD [B] < (CDI 


Henceforth, except in Chapter 20, we shall use the metric structure 
(S, £, 9, d, mJ. 


As we have seen, it has great advantages of simplicity; and these advantages will 
increase as we proceed. 


CHAPTER 9 


THREE GEOMETRIES 


9.1 INTRODUCTION 


This chapter is purely informal; it does not form a part of the deductive sequence 
of the rest of the book. In fact, in this chapter we shall not prove anything at all; 
but everything that we discuss will be taken up more fully later. It may be of some 
help, however, to sketch in advance the kinds of geometry to which our theorems 
are going to apply. 

For the sake of simplicity, we shall limit ourselves to geometry in a plane. 
The ideas that we shall discuss can be generalized to three dimensions, but only 
at the cost of considerable labor. 

Two lines are called parallel if they lie in the same plane but do not intersect. 
In a Euclidean plane, the familiar parallel postulate holds. 


The Euclidean Parallel Postulate. Given a line L and a point P not on L, there 
is one and only one line L’ which contains P and is parallel to L. 


This says that parallels always exist and are always unique. 

For quite a while—for a couple of millennia, in fact—this proposition was 
regarded as a law of nature. In the nineteenth century, however, it was discovered 
by Lobachevski, Bolyai, and Gauss that you could get a perfectly consistent mathe- 


matical theory by starting with a postulate which states that parallels always 
exist, but denies that they are unique. 


The Lobachevskian Parallel Postulate. Given a line L and a point P not on L, 
there are at least two lines L’, L” which contain P and are parallel to L. 


i —————————— L 
Fieure 9.1 


The picture looks implausible, because we are accustomed to thinking of the 
plane of the paper as Euclidean. But it is a fact, as we shall see, that a mathe- 


matical theory can be based on Lobachevski’s postulate. And such a theory actually 
has applications in physics. 


There is yet a third alternative. We can deny not the uniqueness of parallels 
but their existence. 
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The Riemannian Parallel Postulate. No two lines in the same plane are ever 
parallel. 


These postulates give us three kinds of “plane geometry,” the Euclidean, the 
Lobachevskian, and the Riemannian. In each of the three theories, of course, 
many other postulates are needed; we have merely been singling out their crucial 
difference. In this book, we shall be concerned mainly with the first of these geom- 
etries, incidentally with the second, and hardly at all with the third. In the follow- 
ing sections, we give concrete examples, or models, of these kinds of geometry, 
and indicate the most striking differences between them. In going through the 
rest of this book, you should have one of these models in mind most of the time; 


and at some points you should have in mind two of them. 


9.2 THE POINCARE MODEL FOR LOBACHEVSKIAN GEOMETRY 


In this section we shall assume that there is a mathematical system satisfying 
the postulates of Euclidean plane geometry, and we shall use Euclidean geometry 
to describe a mathematical system in which the Euclidean parallel postulate fails, 
but in which the other postulates of Euclidean geometry hold. 

Consider a fixed circle C in a Euclidean plane. We assume, merely for the sake 
of convenience, that the radius of Cis 1. Let E be the interior of C. 


Ficure 9.2 


By an L-circle (L for Lobachevski) we mean à circle C^ which is orthogonal to C. 
When we say that two circles are orthogonal, we mean that their tangents at 
each intersection point are perpendieular. If this happens at one intersection 
point R, then it happens at the other intersection point S. But we shall not stop 
to prove this, or, for that matter, to prove anything else; this chapter is purely 
descriptive and proofs will come later. 

The points of our L-plane will be the points of the interior E of C. By an L-line 
we mean (1) the intersection of E and an L-circle, or (2) the intersection of E and 
a diameter of C. 
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It is a fact that 
l-1. Every two points of E lie on exactly one L-line. 


We are going to define a kind of “plane geometry," in which the “plane” is the 
set E and the lines are the L-lines. In our new geometry we already know what is 
meant by point and line. We need next to define distance and angular measure. 

For each pair of points X, Y, either on C or in the interior of C, let XY be the 
usual Euclidean distance. Notice that if R, S, T and U, are as in the figure, then 
R and S are not points of our L-plane, but they are points of the Euclidean plane 
that we started with. Therefore, all of the distances TS, TR, US, UR are defined, 
and (I) tells us that R and S are determined when T and U are named. There is 
one and only one L-line through T and U, and this L-line cuts the circle C in the 3 
points R and S. We shall use these four distances TS, TR, US, UR to define a 
new distance d(T, U) in our “plane” E, by the following formula: 


ua TR/TS|. 
Evidently we have the following postulate. 
D-0. d is a function 
d:ÉEX E— R. 


Let us now look at the ruler postulate D-4. On any L-line L, take a point U 
and regard this point as fixed. For every point T of L, let 


KT) = log. EDS - 


That is, f(T) is what we get by omitting the absolute value signs in the formula for 
d(T, U). We now have a function, 


f:L—R. 


We shall show that f is a coordinate system for L. 
If V is any other point of L, then 


IV) = log, ES 


UR/US' 
Let z — f(T) and y — f(V). Then 
wr he TR/TS VR/VS TR/TS 
z = |l e rs FFD rral = orere 
le = vl Be UR/US ~ 8e TEOS log. Tp /U8| ' 


because the difference of the logarithms is the logarithm of the quotient. Therefore 
|x > y| on aT, V), 


which means that our new distance function satisfies the ruler postulate. 


Since D-4 holds, the other distance postulates automatieally hold. (See Problem 
1, Section 3.3.) 


kal 
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We define betweenness, segments, rays, and so on, exactly as in Chapter 3. All 
of the theorems of Chapter 3 hold in our new geometry, because the new geometry 
satisfies the postulates on which the proofs of the theorems were based. The same 
is true of Chapter 4; it is rather easy to convince yourself that the plane-separation 
postulate holds in E. 

To discuss congruence of angles, we need to define an angular-measure function. 
Given an “L-angle” in our new geometry, we form an angle in the old geometry 
by using the two tangent rays: 


FIGURE 9.3 


We then define the measure mZBAC of ZBAC to be the measure (in the old 


sense) of the Euclidean angle ZB’ AC’. 
It is a fact that the resulting structure 


(E, L, d, m] 


satisfies all the postulates of Chapters 2 through 6, including the SAS postulate. 
The proof of this takes time, however, and it requires the use of more Euclidean 
geometry that we know so far. Granted that the postulates hold, it follows that 
the theorems also hold. Therefore, the whole theory of congruence, and of geo- 
metric inequalities, applies to the Poincaré model of Lobachevskian geometry. 


FiovnE 9.4 
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On the other hand, the Euclidean parallel postulate obviously does not hold 
for the Poincaré model. Consider, for example, an L-line L which does not pass 
through the center P of C (Fig. 9.4). Through P there are infinitely many L-lines 
which are parallel to L. 

Lobachevskian geometry (also called hyperbolic geometry) is the kind represented 
by the Poincaré model. In such a geometry, when the familiar parallel postulate 
fails, it pulls down a great many familiar theorems with it. A few samples of 
theorems in hyperbolic geometry which are quite different from the analogous 
theorems of Euclidean geometry follow. 


(1) No quadrilateral is a rectangle. In fact, if a quadrilateral has three right 
angles, the fourth angle is always acute. 


(2) For any triangle, the sum of the measures of the angles is always strictly 
less than 180. 


(3) No two triangles are ever similar, except in the case where they are also 
congruent. 


The third of these theorems means that two figures cannot have exactly the same 
shape, unless they also have exactly the same size. Thus, in hyperbolic geometry, 
exact scale models are impossible. 


In fact, each of the above three theorems characterizes hyperbolic geometry. 
If the angle-sum inequality, 


mZA+mZB4+mZC < 180, 


holds, even for one triangle, then the geometry is hyperbolic; and if the angle-sum 
equality holds, even for one triangle, then the geometry is Euclidean; similarly 
for (1) and (3). 

This has a rather curious consequence in connection with our knowledge of 
physical space. If physical space is hyperbolic, which it may be, it is theoretically 
possible for the fact to be demonstrated by measurement. For example, suppose 
that you measure the angles of a triangle, with an error less than 0.0001” for 
each angle. Suppose that the sum of the measures turns out to be 179?59/59.999"', 
The difference between this and 180° is 0.001”. This discrepancy could not be due 
to errors in measurement, because the greatest possible cumulative error is only 
0.0003". Our experiment would therefore prove that the space that we live in 
is hyperbolie. (Granted, of course, that it satisfies the other postulates.) 

On the other hand, no measurement however exact can prove that space is 
Euclidean. The point is that every physical measurement involves some possible 
error. Therefore we can never show by measurement that an equation, 


r+s+t= 180, 
holds exactly; and this is wh 
live in is Euclidean. 

Thus there are two 
hold in physical spac 
unknown. 


at we would have to do to prove that the space we 


possibilities: (1) The Euclidean parallel postulate does not 
e, or (2) The truth about physical space will forever be 
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9.5 THE SPHERICAL MODEL FOR RIEMANNIAN GEOMETRY 


Let V be the surface of a sphere in space. We may as well assume that the radius 
of Vis = 1. A great circle is a circle which is the intersection of V with a plane 
through its center. If T and U are any points of V, then the shortest path on the 
surface joining T' to U is an arc of a great circle. 


Figure 9.5 


We might start to define a kind of “plane geometry” on V by taking the great 
circles as our lines. In this scheme we would take the length of the shortest path 
between each pair of points as the distance between the two points. The resulting 
system has some of the properties that we expect in plane geometry. For example, 
every “line” separates our “plane » into two “half planes,” each of which is convex. 
But the Euclidean parallel postulate fails badly; ie., every two lines intersect. 
Our “geometry” has many other peculiar properties. 

(1) Two points do not necessarily determine a “line.” For example, the north 
and south poles N and 5S lie on infinitely many great circles. 

The same is true for the end points of any diameter of the sphere V. Such points 
are called antipodal. (More precisely, two points A and B of V are antipodal if 
the segment AB passes through the center of V.) 

(2) While our “lines” never come to an end at any point, they are nevertheless 
finite in extent. In fact, if the radius of V is = 1, then the maximum possible 
distance between any two points is m. Thus the ruler postulate cannot possibly 
hold. 

(3) Betweenness, in the form in which we are accustomed to it, collapses com- 
pletely. In fact, given three points of a line it is not necessarily true that one of 
them is between the other two. We may have AB = BC = AC. 

(4) The perpendicular to a line, from an external point, always exists, but is not 
necessarily unique. For example, any line joining the North Pole to a point of the 
equator is perpendicular to the equator. 

(5) Some triangles have two right angles. (In the F ig. 9.5 at the start of this 
section, AANC has right angles at both A and C.) 

(6) The exteriór angle theorem fails. (See the same example.) 
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The only one of these peculiarities that we can avoid is the first. We do this by 
altering the model in the following way. If two points A, B are antipodal, then 
we shall regard them as being the same. To be more precise, a point of our hew 
geometry will be a pair of antipodal points of the sphere V. If A is a point of the 
sphere V, then A denotes the pair (4, A^), where A’ is the other end of the 
diameter that contains A. The points of our Riemannian plane Æ will be the 
pairs A. 


FicunE 9.6 


If L is a great circle on V, then L is the set of all points A for which A is on L. 
The sets L will be.the lines in E. 

The distance d(A, B) between two points A and B is the length of the shortest 
are from A (or A’) to B (or B’). Notice that this may easily be less than the length 
of the shortest arc from A to B. 

In our new geometry, two points A,B always determine a unique "line." The 
reason is that if A and B were antipodal on the sphere, A and B would be the same. 

The Euclidean parallel postulate still fails, of course; two of our new lines always 
intersect in exactly one point. Lines are still of finite extent ; the maximum possible 
distance between two points is now 7/2. Betweenness still does not work. Per- 
pendiculars still are not unique; we still have triangles with two right angles, and 
the exterior angle theorem still fails. 

In fact, in arranging for two points to determine a line, we have introduced a new 
peculiarity: no line separates our Riemannian plane. In fact, if L is a line, and A 
and B are any two points not on L, then there is always a segment which goes 
from A to B without intersecting L. 

In this book, we shall be concerned mainly with Euclidean geometry, but we 
shall devote considerable attention to hyperbolic geometry, mainly because it 
throws light on Euclidean geometry. The point is that these two kinds of geometry 
have so much in common that at the points where they do differ the differences are 
instruetive. On the other hand, the differences between Riemannian and Euclidean 
geometry are so fundamental that it really forms a technical specialty, which is 


remote from our main purpose. We shall not be concerned with it hereafter in this 
book. 


+ 


~~ 
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9.4 SOME QUESTIONS FOR LATER INVESTIGATION 


In this chapter, we have raised more questions than we have answered. 

(1) We have said that the Poincaré model for hyperbolie geometry satisfies all 
of the postulates of Euclidean geometry, with the sole exception of the Euclidean 
parallel postulate. This needs to be proved, and we surely haven’t proved it with 
our conversational discussion in Section 9.2. 

To check these postulates is a rather lengthy job. The reader is warned that this 
sort of verification is discussed rather casually in much of the literature. If the 
models for hyperbolic geometry had in common with Euclidean geometry merely 
the trivial properties that are discussed in semipopular books, they would not 
have the significance which is commonly and rightly attributed to them. 

(2) When the postulates are checked, for the Poincaré model, we will know that 
hyperbolic geometry is just as good, logically, as Euclidean geometry. We con- 
structed the model on the basis of Euclidean geometry. Therefore, if there is a 
mathematical system satisfying the Euclidean postulates, it follows that there is 
a system satisfying the Lobachevskian postulates. 

(3) There remains the if in (2). Is there a system satisfying the Euclidean 
postulates? To prove this, we need to set up a model. We shall see that this can 
be done, assuming that the real number system is given. 


sina) 


ABSOLUTE PLANE GEOMETRY 


10.1 SUFFICIENT CONDITIONS FOR PARALLELISM 

Two lines are parallel if they lie in the same plane but do not intersect. We 
shall use the abbreviation Lı || Zz to mean that the lines Lı and L; are parallel. 
Later on, as a matter of convenience, we shall say that two segments are parallel 
if the lines that contain them are parallel. We shall apply the same term to a line 
and a segment, a segment and a ray, and so on, just as we did somewhat earlier 
in one discussion of perpendicularity. 

The Euclidean parallel postulate will be introduced in the next chapter, and 
used thereafter, except in the chapter on non-Euclidean geometry. The postulate, 


3 
—=-———~+9-__. Á >] 
——————— L 

Figure 10.1 


We shall see, however, from Theorems 1 and 2, that half of this statement can 
be proved on the basis of the postulates that we already have. 


Theorem 1. If two lines lie in the 
same line, then they are parallel. 


RESTATEMENT. Let Ly, L3, and T be three lines, lying in a plane E, such that 
Lı 1 T and L; L T. Then L; || Ls. 


same plane, and are perpendicular to the 
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Figure 10.2 
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Proof. Suppose that Li and Lə intersect T at points Q and P, respectively. 
Suppose that Lı and Lz are not parallel, and let be the point at which they 
intersect. Then there are two perpendiculars to T through R; and this contradicts 
Corollary 1-1 of Chapter 7. 


Theorem 2. Given a line and a point not on the line, there is always at least 
one line which passes through the given point and is parallel to the given line. 


Proof. Let L be the line, let P be the point, and let E be the plane which con- 
tains them. By Theorem 1 of Section 6.5, there is a line T in E which passes through 
P and is perpendicular to L. By "Theorem 4 of Section 8.2, there is a line L'inE 
which passes through P and is perpendicular to T. By the preceding theorem it 
follows that L || L', which was to be proved. 

There is an easy generalization of Theorem 1, which we shall get to presently. 

In the figure below, T is a transversal to the lines Lı and Ls. 

T 


P, L 


0 L 


Ficure 10.3 


More precisely, if Lı, L2, and T are three lines in the same plane, and T intersects 
L, and Lz in two (different) points P and Q, respectively, then T is a transversal 


to Ly and La. 
In the figure below Z1 and Z2 are alternate interior angles; and 23 and Z4 


are alternate interior angles. 


Ficure 10.4 


More precisely, (1) if T is a transversal to Lı and Lo, intersecting Li and Lg in 
P and Q, respectively, and (2) A and D are points of Lı and Lo, respectively, 
lying on opposite sides of T, then ZAPQ and ZPQD are alternate interior angles. 
(Under a change of notation, this definition says also that ZCQP and ZQPB 


are alternate interior angles.) 
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Theorem 3. Given two lines and a transversal. If a pair of alternate interior 
angles are congruent, then the lines are parallel. 


The proof is exactly like that of Theorem 1. 


In the figure below, Z1 and 21’ are corresponding angles, Z2 and 22’ are 
corresponding angles, and so on. 


Figure 10.5 


DEFINITION. If Zx and £y are alternate interior angles, and Zy and Zz are 
vertical angles, then Z: and Zz are corresponding angles. 


Theorem 4. Given two lines and a transversal. If a pair of corresponding angles 
are congruent, then a pair of alternate interior angles are congruent. 


Theorem 5. Given two lines and a transversal. If a pair of corresponding angles 
are congruent, then the lines are parallel. 


10.2 THE POLYGONAL INEQUALITY 
The triangular inequality states that for any triangle ABC we have 


AB + BC > AC. 


If 4 B, and C are not required to be noncollinear or even different, we get a weaker 
result. 


Theorem 1. For any points A, B, C, 


AB + BC 2 AC. 


Proof. If A, B, and C are noncollinear, this follows from the triangular in- 
equality. If A, B, and C are collinear, we take a coordinate system on the line 
that contains them and let their coordinates be z, y, and z. Let 


a=r-—y, b=y—z. 
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By Theorem 12, Section 1.4, we know that 


la| + Ibl 2 la + bl, 
therefore 
je — yl + ly —4l & le — z 
Hence 
AB + BC 2 AC, 
which was to be proved. From this we get the following theorem. 
Theorem 2. The Polygonal Inequality. Tf A, Ao,...,An are any points 
(n > 1), then 


Aiz + Aå t+ t + AraAra Arin 


The proof is by induction. 
We shall need this result in the following section. For the first time, we are also 


about to use the Archimedean postulate for the real number system, given in 
Section 1.8. This says that if e > 0 and M > 0, then ne > M for some positive 


integer n. 


10.3 SACCHERI QUADRILATERALS 


tion 4.4, the definition of a quadrilateral. Given four points 
e sa but no three of them are 


collinear. If the segments AB, BC, CD, DA intersect only at their end points, 


then their union is called a quadrilateral, and is denoted by []ABCD. The segments 
AB, BC, CD, DA are the sides of (JABCD, and the segmen 
diagonals. The angles of (JABCD are ZABC, ZBCD, ZCDA, and ZDAB; 
they are often denoted briefly as 2B, ZO ZDA: If all four of the angles are 
right angles, then the quadrilateral is a rectangle. 

On the basis of the postulates that we have so far, without the use of the parallel 
postulate, it is impossible to prove that any rectangles exist. If we try, ina plausible 
fashion, to construct à rectangle, we get what is called a Saccheri quadrilateral. 


B C 
A D 
Ficure 10.6 


The definition is suggested by the markings on the figure above. To be precise, 


(ABCD is a Sacchert_ quadrilateral if ZA and ZD are right angles and AB = 
CD. The segment AD is called the lower base; and BC is called the upper base. 
The lower base angles are ZA and ZD;and ZB and ZC are the upper base angles. 
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Theorem 1. The diagonals of a Saccheri quadrilateral are always congruent. 
Proof. By SAS, we have ABAD = ACDA. Therefore BD S= AC. 
B C 


FicunE 10.7 


Roughly speaking, the following theorem states that a Saccheri quadrilateral 
is completely described, geometrically, by the distances AD and AB. 


Theorem 2. Let []ABCD and []A'B'C'D' be Saccheri quadrilaterals, 
with lower bases AD and A'D'. If A'D = AD and AP œ= AB, then 
BC = B'C', ZB' = ZB and ZO = ZC. 


Figure 10.8 


Proof. The main steps in the proof ure as follows. 

(1) AACD S AA'C'D' (by SAS). 

(2) ZA = ZA’ (all right angles are congruent). 

(3) ZBAC & ZB'A'C". 

(4) AC = AC. 

(5) AABC S AA'B'C'. 

(6) ZB ZB’. 

(7) BC = BC. 

(8) ZC = ZC’. 

Applying this theorem to the Saccheri quadrilaterals []ABCD, [.]DCBA, we 
get ZB = ZC. Thus we have the following theorem. 


Theorem 3. In any Saccheri quadrilateral, the upper base angles are congruent. 


Theorem 4. In any Saccheri 


quadrilateral, the upper base is congruent to or 
longer than the lower base. 
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RESTATEMENT. Given a Saccheri quadrilateral [ ]A ;B4B5À5, with lower base 
A142. 'Then B4B; z Aido. 


Proof. Let us set up a sequence of n Saccheri quadrilaterals, end to end, start- 
ing with the given one, like this: 


B, B; B; B, Bia Bn Bn+t 
A as A; du da, cde V Ang 
Ficure 10.9 


That is, Ag, Aa,---, Anti are points of the line A, Az, appearing in the stated 
order on Å, Az; the angles 2B2A2A3, Z B3À3A4, ... and so on are right angles; 


AyAg = Aods = Asda = c = AntAn = Arit 
and 
AB = AsBs = -e = A,B, = An41Bn41- 


By Theorem 2, we have 
B,B2 m B3B; erii dial B4 .1Bs = B,Bn41- 


We don’t happen to know anything about the question of collinearity for the 
points By, Bo,..., Bn4i. But we know by the polygonal inequality that 
B,Bn41 £ BiB2 + BaB; + -++ + Bn—1Bn + ByBn+1- 
Since all of the distances on the right are =B,Bz, we have 
B,Bnyi < n: BBs. 
By the same principle, we get 
AiÁAn4i S AiBi + By Bnyi + BapiAnti € A,B, + nBiBo + A,B. 


Since A;An41 = n4145, we have 


nA,Aq € nB,B2 + 2A,B;, 


and this conclusion holds for every n. , 
Now suppose that our theorem is false. Then A142 > BıB2, so that A145 — 
B,Bz is a positive number. Obviously, 241B; is a positive number. Let 


e= A142 -n B4B5, and M= 2A,B,. 


Then e > 0 and M > 0, but ne S M for every positive integer n. This con- 
tradicts the Archimedean postulate, and so completes the proof. 


128 ABSOLUTE PLANE GEOMETRY 
10.4 THE BASIC INEQUALITY FOR ANGLE-SUMS IN A TRIANGLE 


A well-known theorem of Euclidean geometry asserts that, in any triangle, the 
sum of the degree measures of the angles is exactly 180. Without the parallel 
postulate, we shall show that this sum is always less than or equal to 180. We shall 
need some preliminaries. 


Theorem 1. In any Saccheri quadrilateral LJABCD (with lower base AD), we 
have ZBDC = ZABD. 


FiavnE 10.10 


Proof. We know that BA — DC and BD = BD. If it were true that ZABD > 
ZBDC, then it would follow by Theorem 6, Section 7.1, that AD > BC; and this 


would contradict Theorem 4, Section 10.3. Therefore ZABD S ZBDC, which 
was to be proved. 


From this we get an immediate consequence for right triangles. 


Theorem 2. If AABD has a right angle at A, then 
mZB+mZD x 90. 


Proof. Let C be a point such that L]ABCD is a Saccheri quadrilateral: 
B [^j 


Ficure 10.11 
Then mZ3+mZ2 = 90, 
because ZADC is a right angle. By the preceding theorem, mZ2 > m/1. 
Therefore 
90 —mz3 = mZ1, and  mZ14 mZ3 x 90, 
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which was to be proved. Thus we have: 


Theorem 3. Every right triangle has only one right angle; and its other two 
angles are acute. 


Of course we could have proved this long ago, merely by applying the exterior 
angle theorem. But we shall need Theorem 2 for other purposes anyway. 

We can now define the hypotenuse of a right triangle as the side opposite the 
(unique) right angle. The other two sides are called the legs. 


Theorem 4. The hypotenuse of a right triangle is longer than either of the legs. 


Because the angle opposite the hypotenuse is larger. (See Theorem 3, 
Section 7.1.) 


In a triangle, two sides can easily be congruent. Therefore we cannot always 
speak of the longest side. We can, however, always speak of a longest side; this 
means a side at least as long as any other side. 


Theorem 5. In AABC, let D be the foot of the perpendicular from B to AC. 
If AC is a longest side of A ABC, then A-D-C. 


B 


A D Cc 
Fieure 10.12 


(This theorem, if available, would have simplified the proof of the SSS theorem.) 

Proof. Suppose that the theorem is false. Then we have D = A, D-A-C, 
D = Cor A-C-D. We need to show that all of these cases are impossible. Since 
the latter two cases are essentially the same as the former two, it will suffice to 
show that D = A and D-A-C are impossible. 


B 


A=D c 
Figure 10.13 


If A = D, then AABC isa right triangle with right angle at A; therefore 
BC > AC, and AC is not a longest side. 
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If D-A-C, then AC < DC. Also DC < BC, because BC is _the hypotenuse of 
ABDC, and DC is one of the legs. Therefore AC < BC, and AC is not a longest 
side of AABC. 


Finally, we can prove the theorem that we were working toward. 
Theorem 6. In any triangle ABC, we have 
mZA+mZB+mZC < 180. 


Proof. Suppose, without loss of generality, that AC is a longest side of A ABC; 
and let BD be the perpendicular segment from B to the line AC. By the preced- 
ing theorem we have A-D-C; this means that D is in the interior of Z ABC. 


B 


A D E 
Ficure 10.14 


We now apply Theorem 2 to each of the right triangles A ADB and ABDC. Thus 


mZA-+mZABD < 90 
and 


mZDBC -- mzC x 90. 
Therefore 


mZA + mZABD + mZ DBC +mZC S 180. 
Since D is in the interior of Z ABC, we have 


mZABD +-mZDBC = mZB. 
Therefore 


mZA -- mZB --mzC < 180, 
which was to be proved. 


10.5 A HISTORICAL COMEDY 


Saccheri quadrilaterals are named after the Italian geometer Gerolamo Saccheri 
(1667-1733). Like most geometers of his time, Saccheri was dissatisfied with the 
situation of the parallel postulate; he believed that this statement ought to be 
proved asa theorem. He wrote a rather long book, entitled Euclides ab omne naevo 
vindicatus, in which he undertook to “vindicate Euclid of every blemish” by 
showing that the parallel postulate was a consequence of the other postulates of 


synthetic geometry. On certain rare oc ions, the development of mathematics 
involves high comedy. This was one of them. 
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— In the first place, Saccheri’s “proof” of the postulate was fallacious. However, 
the early stages of the proof were quite correct, and the preliminary theorems were 
oth new and important. If you omit the erroneous part of his book, what you get 
is the first treatise on what is now called absolute geometry. That is, Saccheri 
developed an extensive geometric theory which was independent of the whole 
"question of the parallel postulate. (The preceding portions of this chapter are a 
‘sample of this sort of theory.) For this achievement, Saccheri is highly honored 
‘and justly so. Prosaic accuracy is not an essential virtue in mathematics any 
more than it is in any other sort of creative work; what really matters is the 
"quality of a man’s positive contribution. 

| The final irony is that if Saccheri’s enterprise had really sueceeded in the way he 
thought it had, no modern mathematician would have regarded his book as a 
vindication of Euclid. From a modern viewpoint, a proof of the parallel postulate 
would merely show that the postulate was redundant; and redundancy is not 
thought of as a virtue in a set of postulates. There are two main things that a 
modern mathematician wants to know about a postulate set: (1) The postulates 
ought, by all means, to be consistent, in the sense that none of them contradicts 
the others. If this condition does not hold, then any mathematical theory based 
on the postulates is, quite literally, much ado about nothing, because, in this 
ease, there isn’t any mathematical system that satisfies the postulates. The only 
way to show that a set of postulates is consistent is to show that there is a system 
- in which all of the postulates are satisfied. Such a system is called a model for the 
. postulate set. (2) The postulates ought, if possible, to be independent, in the sense 
that no one of them is deducible from the others as a theorem. Any postulate which 
is so deducible is called redundant. To show that a particular postulate is redundant, 
you have to prove it on the basis of the others; and this is what Saccheri tried to 
do for the parallel postulate. To show that a particular postulate is independent 
of the others, you have to show that there is a mathematical system in which all 
of the other postulates are satisfied, but in which this particular one is not. (See, 
for example, Section 6.4, in which we showed that the SAS postulate is independent 
of the postulates that precede it.) 

In the nineteenth century, two fundamental questions were settled. First, it 
was shown that the postulates of synthetic geometry, including the parallel 
postulate, were consistent—granted, of course, that the real number system is 
consistent. It was shown further that the parallel postulate is independent of the 
others. This was done, in the only way that it could be done, by the discovery of 
“geometries” in which all the synthetic postulates except the parallel postulate 


were satisfied. 
These two later developments were the real vindication of Euclid from a modern 


viewpoint. 


| = 


THE PARALLEL POSTULATE AND 
PARALLEL PROJECTION 


11.1 THE UNIQUENESS OF PARALLELS 


The Euclidean postulate is as follows. 


P-1. Given a line and an external point, there is only one line which passes 
through the given point and is parallel to the given line. 


This gives us immediately a converse of Theorem 3, Section 10.1. 


Theorem 1. Given two lines and a transversal. If the lines are parallel, then 
each pair of alternate interior angles are congruent. 


Figure 11.1 


Proof. There is exactly one line Li, through P, for which the alternate interior 
angles are congruent, and by Theorem 3, Section 9.1, we have Li || La. | Since 


there is only one such parallel, we have Li = Ly. Therefore Z1 = 22, which 
was to be proved. 


The proof of the following theorem is entirely analogous. 


Theorem 2. Given two lines and a transversal, 
each pair of corresponding angles is congruent. 


The inequality mZ A +.mZB +mZC s 


If the lines are parallel, then 


180 now becomes an equation. 


Theorem 3. In any triangle A ABC we have 


mZA+mZB+mZC = 180. 
" 132 
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Proof. Let L be the parallel to AC through B. Let Dand E be points of L, 
such that D-B-E, and such that D and C are on opposite sides of AB. Then 


mZ2 + mZB = mZDBC, D B E 
and L 
mZDBC + mZ1 = 180. 4 ; 

Therefore 

mZ1 + mZB + mZ2 = 180. 
By Theorem 1, 

mZ1 = mZC and mZ2 = mZA, 

therefore Figure 11.2 

mZA+mZB+mZC = 180, 


which was to be proved. The following theorems are an immediate consequence. 
Theorem 4. The acute angles of a right triangle are complementary. 


Theorem 5. Every Saccheri quadrilateral is a rectangle. 


B C 
ui [ ] 
A D 
Ficure 11.3 


Proof. By Theorem 1, 22 = ZA. Since AB = DC and AC = AC, it follows 
that ABAC = ADCA. Therefore 21 = Z3. Since mZ1 + mZ2 = 90, it 
follows that mZ4 -+ mZ3 = 90; that is, ZBCD is a right angle. The proof 
that ZB is a right angle is obtained merely by permuting the notation. Thus we 
have finally shown that rectangles exist. 

Note that in this proof we are relaxing a bit in the style of the exposition by 
using a figure to explain the notation. If the reader (or the writer) sees no other 
way to explain, say, the idea of alternate interior angles, then it is worthwhile to 
fight our way through the problem as we did in the previous chapter. But once 
we have done this, we have earned the right to speak in the abbreviated language 
of pictures. 

A quadrilateral is a trapezoid if at least one pair of opposite sides are parallel. 
(It is sometimes required that the other pair of opposite sides be nonparallel, but 
this is rather artificial, just as it would be artificial to require that an isosceles 
triangle be nonequilateral.) If both pairs of opposite sides of a quadrilateral are 
parallel, then the quadrilateral is a parallelogram. If two adjacent sides of a 
parallelogram are congruent, then the quadrilateral is a rhombus. The proofs of 
the following theorems are omitted. (They are not much harder to write than 


to read.) à 
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Theorem 6. For any*triangle, the measure of an exterior angle is the sum of the 
measures of its two remote interior angles. 


Theorem 7. In a plane, any two lines parallel to a third line are parallel to each 
other. 


Theorem 8. If a transversal is perpendicular to one of two parallel lines, it is 
perpendicular to the other. 

Theorem 9. Hither diagonal divides a parallelogram into two congruent triangles. 
More precisely: If (]ABCD is a parallelogram, then A ABC = ACDA. 


Theorem 10. In a parallelogram, each pair of opposite sides are congruent. 


Theorem 11. The diagonals of a parallelogram bisect each other. 


That is, they intersect at a point which is the bisector of each of them. Thus 


the proof must begin with a proof that the diagonals intersect each other (see 
Theorem 1, Section 4.4). 


11.2 PARALLEL PROJECTIONS 


We know, by Theorem 4, Section 8.2, that the perpendicular from a point to a 
line always exists and is unique. 


R E 
9 i 
P | | 
| CEN 
| | | 


eeey 
P'-f(P) Q'-f(Q) R'=f(R) 
Fieure 11.4 


Thus, given two lines L, L’ in the same plane, we can define the vertical projection 
of L into L’. This is the function 


f:LoL 


under which to each point P of L there corresponds the foot P’ = f(P) of the 
perpendicular from P to L’. In fact, the vertical projection can be defined equally 
well for the case where the lines are not necessarily coplanar; and the definition 
is exactly the same. This degree of generality, however, will not concern us. Note 
also that the existence and uniqueness of the vertical projection do not depend 
on the parallel postulate. We do, however, need this postulate to define and in- 
vestigate the more general notion of parallel projection. Under this more general 
scheme, instead of following the perpendicular, to get from P to P’ = f(P), we 
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proceed in any direction we want, providing, however, that we always go in the 
same direction for every point P of L. More precisely, the definition of parallel 
projections is as follows. 

Given two lines L, L’ and a transversal T. (By definition of a transversal, this 
means that all three of our lines are coplanar.) Let T intersect L and L’ in points 
Q and Q', respectively. 


FicunE 11.5 


Let f(Q) be Q'. For every other point P of L, let Tp be the line through P, parallel 
to T; let P’ be the point where Tp intersects L’; and let f(P) be P’. This defines 
a function 

f:L> Lr. 


Of course, we should check that for each point P of L, the corresponding point 
P' exists and is unique; if either of these conditions failed, then the directions that 
we have just given would not define a function. However, the verifications are 
simple. 

(1) If Tp were parallel to L’, then it would follow that T || L’, which is false, 
because T is a transversal to L and L’. Therefore Tp intersects L’ in at least one 
point P’. 

(2) If Tp = L’, it follows that L’ || T, which is false. Therefore Tp intersects 
L’ in at most one point P'. 

Thus to each point P of L there corresponds exactly one point f(P), and we really 
have a function. This function f is called the projection of L onto L' in the direction 
m, 


Theorem 1. Every parallel projection is a one-to-one correspondence. 


Proof. Given 
f:L 9 UL, 


the projection of L onto L’ in the direction T. (See Fig. 11.5.) Let g be the pro- 
jection of L/ onto L in the direction T. Obviously g reverses the action of f; that is, 
if P = g(P’), then P’ = f(P). Therefore f has an inverse 

yb egi eb 


Therefore f is a one-to-one correspondence L €» L’, which was to be proved. 
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(Another way of putting it is to say that every point P’ of L’ is = f(P) for one — 
and only one point P of L. It may be worthwhile, at this stage, to review the 
discussion of functions in Section 3.1.) a 


Theorem 2. Parallel projections preserve betweenness. 


RESTATEMENT. Let f:L +> L' be a parallel projection. If P-Q-R on L, then 
P'-Q'-R' on L'. 


f ! ] 

f QE 
Tp To Tn 
Fieure 11.6 


Here, of course, P” = f(P), Q’ = f(Q), and R’ = f(R). 
Proof. Let Tp, To, Tr be as in the definition of a parallel projection, so that 
Tp || To || Tr. 


Then E and R’ are on the same side of To, because RR’ does not intersect TQ. 
Similarly, P and P' are on the same side of Te. But P and R are on opposite sides 
of To, because P-Q-R. By two applications of Theorem 2, Section 4.2, P’ and R’ 
are on opposite sides of To. Therefore P/R/ intersects T'o in a point X. Since 
To »* L’, there is only one such point of intersection. Therefore X = Q'. Therefore 
Q' lies on P'E', and P’-Q’-R’, which was to be proved. 


Theorem 3. Parallel projections preserve congruence. 


Restatement. Let f:L €» L' be a parallel projection. If "AB œ CD on L, 

then A'B’ = C'D' on L’. 

Proof. (1) If L || L’, then AB and A’B’ are opposite sides of a parallelogram. 
By Theorem 10, Section 11.1, it follows that AB = AB’. Similarly, CD = CD. 
Therefore A'B’ 2 CD’, as desired. 

(2) Suppose that L and L/ are not parallel, as in Fig. 11:7. Let V be the line 
through A, parallel to L’, intersecting T' at E. Let W be the line through C, 
parallel to Z’, intersecting T' at F. j 

Now V || W, and L is a transversal. For appropriate choice of the notation for 
Cand D, Z1 = ZBAB and Z1' = ZDCF are corresponding angles. (This is 
true for the case shown in the figure; if it isn’t true, we interchange the letters C 
and D.) By Theorem 2, Section 10.1, we have 


412 Z]. 
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FiavnE 11.7 
For the same reasons, 
Z2 ce Z2'. 
Since AB & CD by hypothesis, it follows by ASA that 
AABE = ACDF. 
Therefore AE & CF. But AE = A'B' and CF = CD’, because these segments 
are opposite sides of parallelograms. Therefore A'B’ & C'D', which was to be 


proved, 
Consider now three parallel lines with two common transversals, like this: 


Figure 11.8 
It ought to be true that 

A'B' AB 

BO BO 


In the style of our previous theorems, we could convey this by saying that parallel 
projections preserve ratios. In fact, this is true, but the proof is rather hard, and will 
be given in the following two sections. The theorem is well worth working for; 
it is the foundation of the whole theory of similarity for triangles. The proof 
will depend on the Euclidean parallel postulate, as one might expect: if parallels 
are not unique, then parallel projections are not even well defined. 
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11.3 THE COMPARISON THEOREM 


The algebraic method that we shall use to prove that parallel projections pre- 
serve ratios is going to be rather peculiar. It will be based on the following theorem. 


Theorem 1. The Comparison Theorem. Let x and y be real numbers. Suppose 
(1) that every rational number less than z is also less than y, and (2) every 
rational number less than y is also less than z. Then x = y. 


The proof is easy on the basis of Theorem 2, Section 1.8. Suppose that x < y. 
Then there is a rational number p/q, between z and y. Thus 


a<Pcy, 


and p/q is less than y, but not less than x. This contradicts (2). Similarly, if 
y < x, we have 
y< r <Ẹ 


for some rational number p/g; this contradicts (1). Therefore z = y, which was 
to be proved. 


This seemingly trivial observation turns out to be surprisingly powerful, as we 
shall see. 


11.4 THE BASIC SIMILARITY THEOREM 


The purpose of this section is to show that parallel projections preserve ratios. 
Let us first consider the special case indicated by the following figure, and treated 
in the following theorem. 


Figure 11.9 


Here Lı, Le, and Ls are three parallel lines, with common transversals T and T”. 
We want to prove that 

BC _ BC 

AB AB 
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Theorem 1. Let Lı, Lz, and Lg be three parallel lines, with common transversals 
T and T" intersecting them in points A, B, C and A’, B', C’. If A-B-C (and 
A'-B'-C^), then 


BC _ BC 
AB ABP 
Proof. Let 
BC B'C* 


t€ qe Yu 


Let p and q be any two positive integers. 
(1) First we divide AB into q congruent segments, end to end, as in the figure: 


T gr 


Fieure 11.10 


That is, we take a sequence of points 
A= Ao Ap ^; Aa = B 


in the stated order on the ray AB, so that the length of each of the resulting seg- 
ments is AB/q. e 

(2) Next we lay off, on the ray BC, a sequence of p segments, of the same length 
AB/q. 'The end points of these segments are 


B Bs Boss. 


(3) Now we project each of the points A;, B; onto T”, in the direction L,, thus 
getting the points Aj, Bj on T". 
Since all of the little segments on T are congruent, we have 
BB, _ p, 
AB q 
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Since parallel projections preserve congruences, all the little segments on 7" are 
congruent. Therefore BB, p 


A'B 7 q 


We can now complete the proof easily, in two steps. 
(4) Suppose that 


Po pe BA 
cot a 
Then 
p a < BC 
Therefore 
BB, < BC. 
Hence 
B-B,-C, 
as indicated in Fig. 11.10, and 
B'-B;-C', 
because parallel projections preserve betweenness. 
Therefore 
A'B' 
B'B « B'C', p. X < BG 
and 
Ro BC. 
q XE 
(Here we have merely reversed the steps that led from 
p. BC 
q E AB 


to B-B,-C.) Thus we have proved that if p/g « z, then p/q € y. 
(5) By exactly the same sort of reasoning, we conclude that if p/q < y, then 
p/q < z. 


It follows by the comparison theorem that z = y, which was to be proved. 
We extend Theorem 1, by various tricks, to get the general case. Consider any 
four points on T, and the corresponding points on 7”, under a parallel projection: 


Figure 11.11 
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Here a, b, c and so on are the lengths of the indicated segments. By two applica- 
tions of Theorem 1, we have 


Therefore we have 


so that 


Stated in words our result is as follows. 


Theorem 2. If two segments on the same line have no point in common, then 
the ratio of their lengths is preserved under every parallel projection. 


From this it is easy to prove our main theorem. 


Theorem 3. Parallel projection preserves ratios. 


RESTATEMENT. Let T and T” be lines. Let A, B, C and D be any points of T, 
and let A’, B', C', and D' be the corresponding points of 7", under a parallel 
projection. Then 
An "Aup 9 g E ED. 
CD ODA XP CD 
Proof. Let XY be a segment which has no point in common with AB or CD, 
and let X"Y^ be the corresponding segment on T', under the parallel projection. 
Then i 
AB XY MCL 
AB o XY HOE: 
by Theorem 2. Therefore 


AB | CD 
AB CD 

and 
AB | A'B' 
OD OD 


which was to be proved. 


ime |Z 


SIMILARITIES BETWEEN TRIANGLES 


12.1 PROPORTIONALITIES 


Given two sequences 
E ty Nee | 
C050 iets ay b^ os. 


of positive numbers. If it is true that 
ao Ep sh 


pe Fens 
then we say that the two sequences are proportional, and we write 


CE EC a RR 
The constant ratio, 


, 
a 
kr m ca E49 


is called the proportionality constant. Note that proportionality is a symmetric 
relation. That is, if 


+ , , 
a, b, Cp wis Oy Cry aches 


then 
9^, 0,6... a b, esu... 


and conversely. Note, however, that the proportionality constant depends on the 
order in which the sequences are written. If we reverse the order, we get a new 
constant which is the reciprocal of the old one. 


To work with proportionslities, we merely express them in terms of equations 
between fractions, and then use the ordinary rules of algebra. 
A sample theorem follows. 


Theorem. If a, b ~ c, d, then a, € ~ b, d, and conversely, 


Proof. The first proportionality means that 


and the second means that 
bed 
"im 


Trivially, these equations are equivalent. 
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This being the case, the question arises why the notation deserves to be in- 
troduced at all. The reason is that the “~” relation is often easy to read off from 
a figure. Moving a little ahead of ourselves, for the sake of illustration, consider 
a pair of similar triangles, like this: 


gre 
w y 
d 
x 


Figure 12.1 


Writing the lengths of the sides in the proper order, we get the proportionality 
a,b,c ~ Ww, y, x. 


We have now made the transition from geometry to algebra, and can proceed to 
work algebraically with fractions, starting with the equation 


12.2 SIMILARITIES BETWEEN TRIANGLES 


Given A ABC, ADEF and a correspondence 


ABC — DEF. 
B E 
T d 
[4 a 
D e F 
A b C 
Figure 12.2 


We use the familiar convention, under which a is the length of the side opposite 
ZA, and so on. If 
a, b, [enl d, Ed 


then we say that corresponding sides are proportional. If corresponding sides are 
proportional, and every pair of corresponding angles are congruent, then we say 
that the correspondence is a similarity, and we write 


AABC ~ ADEF. 
If there is a similarity between two triangles, then we say that the triangles 


are similar. (As in the case of congruences and congruence, the occasions when 
this is what we really mean are exceedingly rare.) 
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We remember that, from the expression 
AABC = ADEF, 
we could read off—without further reference to a figure—three angle-congruences 
ZA = ZD, ZB S ZE, LC S LF 
and three segment-congruences 


AB = DE, AC=DF, BC = EF. 


I 


In the same way, from the expression 
AABC ~ ADEF, 
we can read off the same three angle-congruences, and the proportionality 
AB, AC, BC ~ DE, DF, EF. 


* To get the right-hand half of this expression, we simply replace each letter A, 
B, or C on the left by the corresponding letter D, E, or F. 
Intuitively speaking, two iriangles are similar if they have the same shape, 
although not necessarily the same size. It looks as if the shape ought to be deter- 
mined by the angles alone, and this is true. 


e 
Theorem 1. The AAA Similarity Theorem. Given a correspondence between 
two triangles. If corresponding angles are congruent, then the correspondence 
is a similarity. 
Restatement. Given AABC, ADEF and a correspondence 
ABC — DEF. 
If ZA S ZD, ZB S ZE, and ZC c ZF, then 
AABC ~ ADEF. 


Proof. Let E' and F’ be points of AB and AC, such that AE' = f and AF’ = e, 
as shown in Fig. 12.3. By SAS, we have 


AAE'F' = ADEF. 


Therefore ZAE'F' = ZE. Since ZE c ZB, we have Z AE'F' = ZB; thus 
ET || BC, and A, F', and C correspond to A, E', and B under a parallel projec- 
tion. Since parallel projections preserve ratios, we have 
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Figure 12.3 


In exactly the same way, merely changing the notation, we can show that 


5 E p 
AG, -BER 
therefore 
d, e, f ~ BC, AC, AB, 
and 


d, e, f ~ a, b, c. 


Hence corresponding sides are proportional, and the correspondence ABC «+ DEF 
is a similarity, which was to be proved. 

Of course it follows from our angle-sum theorem that if two pairs of correspond- 
ing angles are congruent, so also is the third pair. Thus we have the following 
theorem. 


Theorem 2. The AA Similarity Theorem. Given a correspondence between two 
triangles. If two pairs of corresponding angles are congruent, then the correspond- 
ence is a similarity. 


We also have a sort of converse of Theorem 1. 
Theorem 3. The SSS Similarity Theorem. Given two triangles and a correspond- 


ence between them. If corresponding sides are proportional, then corresponding 
angles are congruent, and the correspondence is a similarity. 


Restatement. Given AABC, ADEF, and a correspondence ABC €» DEF. If 


a, b, ec d, €; f, 
then 
AABC ~ ADEF. 


Proof. Let E' be the point of AB for which AE = f (Fig. 12.4). Let L be the 
line through Æ’, parallel to BC. If L || AG, then BÓ || AC, which is false. There- 
for L intersects AC at a point F’. 
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Figure 12.4 


Now Z AE'F' = ZB, because these are corresponding angles; and ZA c ZA, 
Therefore 


AAE'F' ~ AABC; 


hence 
f, AF’, E'F' ~ c, b, a. 
Therefore 
$7 4p pin and A=", plu 
But 
5,6, d ~ c, b, a. x 
Thus 
ebd a M - af 
Vim ale and AES Sm e 


By SSS, we have 

AAE'F' = ADEF. 
Therefore 

¿ADEF ~ AABC, 


which was to be proved. 
Next we bave an analogue of SAS. 


Theorem 4. The SAS Similarity Theorem. Given a correspondence between two 


triangles. If two pairs of corresponding sides are proportional, and the included 
angles are congruent, then the correspondence is a similarity. 


RESTATEMENT. Given AABC, ADEF, and the correspondence ABC + DEF. 
If ZA S Z D, and b, c ~ e, f, then AABC ~ ADEF. 
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Figure 12.5 


Proof. Let E' be the point of AB for which AE’ = f. Let L be the line through 
E', parallel to BÓ. Then L intersects AG in a point F’. The main steps in the proof, 
from here on, are as follows. You should be able to supply the reasons in each case. 

(1) AAE'F' ~ AABC. 

(2) b, c ~ AF", f. 

(3) AF' — e. 

(4) AAE'F' = ADEF. 

(5) AABC ~ ADEF. 


Theorem 5. Given a similarity between two triangles. If a pair of corresponding 
sides are congruent, then the correspondence is a congruence. 


Proof? This was really a step in the proof of the preceding theorem. — 
An altitude of a triangle is & perpendicular segment from a vertex to the line con- 
taining the opposite side. 


Fieure 12.6 


As the figure indicates, every triangle has three altitudes. We shall use the same 
word altitude for the length of such a perpendicular segment. In a right triangle, 
the altitude to the hypotenuse is always an “interior altitude,” like this: 


B 


4) 


A D c 
Figure 12.7 


That is, if ZB is a right angle, and BD L AC, then A-D-C. (This follows from 
Theorem 4, Section 10.4). 
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12.3 THE PYTHAGOREAN THEOREM 


To prove the Pythagorean Theorem, we need one preliminary result. 


Theorem 1. The altitude to the hypotenuse of a right triangle divides it into two 
triangles each of which is similar to it. 


RESTATEMENT. Let AABC be a right triangle with its right angle at C and let 


D be the foot of the perpendicular from C to AB. Then AACD ~ AABC ~ 
ACBD. 
G 


A D B 
Figure 12.8 


(To remember the way these similarities work, all you need to do is to observe 
that there is only one way that they might work. In the first correspondence, we 
must have A €» A, because Z A is common to A ACD and AABC. Also we must 
have D +> C because these points are where the right angles are. Finally, we must 
have C + B, because at this stage C has nowhere else left to Ego. Therefore the 
correspondence must be ACD + ABC; and similarly for the second similarity.) 

Proof. Obviously ZA = ZA. And ZADC z ZACB, because both of these 
are right angles. By the AA similarity theorem, we have 


AACD ~ AABC. 
The proof of the other half of the theorem is exactly the same. 
Theorem 2. The Pythagorean Theorem. In any right triangle, the square of the 
length of the hypotenuse is the sum of the squares of the lengths of the other 
two sides. 
Restatement. Let AABC be a right triangle with its right angle at C. Then 
a? + b? = c?, 


Here we are using the usual notation for lengths of opposite sides, 


Fieure 12.9 
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Proof. By the preceding theorem, 


AACD ~ SABC ~ ACBD. 
Therefore 
h, f, b oe a, b, c ET g, h, a. 


We shall calculate f and g in terms of a, b, and c, and then use the fact that f + 
g = c. (Query: What theorem do we need to know, to conclude that f +g = c?) 
Step 1. Since 


i Mi 
pea 
we have 
b? 
Lien 
And since 
gn. 
p UR 
we have 
a? 
Cras cade 
Step 2. Therefore 
2 2 
b 
Cirad s c c 


Therefore 
a? + b? = c, 
which was to be proved. 

Legend has it that when Pythagoras discovered this theorem, he sacrificed an 
ox as a thank offering. (The legend is doubtful; and it is not even known whether 
the theorem was proved by Pythagoras personally.) The German poet, Heinrich 
Heine, remarked that ever since this sacrifice, the oxen have trembled whenever 
a great truth was discovered. 

The proof given here is not the one given in Euclid. Euclid's proof (to be dis- 
cussed later) made heavy use of the theory of area. During the last two thousand 
years or so, the literature of the Pythagorean theorem has become immense. 
Literally hundreds of proofs have been given. The converse of the Pythagorean 
theorem is also true, and its proof is easy. 

Theorem 3. Given à triangle whose sides have lengths a, b, and c. If a? 4- 

p? = c?, then the triangle is a right triangle, with its right angle opposite the 

side of length c. 

Proof. Given AABC, with a? + 6? = c?. Let ZF be a right angle, and let 


D and E be the points on the sides of ZF such that FE = a and FD = b (Fig. 
12.10). By the Pythagorean theorem, 


DE? = a? + V’. 


DE = Va? 4- b? — c. 


Therefore 
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A 


Ficure 12.10 


By SSS, AABC £& ADEF. Therefore AABC is a right triangle, with its right 
angle at C. 

For right triangles, we have a special congruence theorem and also a special 
similarity theorem, 


Theorem 4. The Hypotenuse-Leg Theorem. If the hypotenuse and a leg of one 
right triangle are congruent to the hypotenuse and a leg of another, then the 
triangles are congruent. 


RESTATEMENT. Given AA BC, with a right angle at C, and AA'B'C', witha right 
angle at C'. If a = a' and c = ce’, then AABC zx AA'B'C'. 
Proof. By the Pythagorean theorem, 


b= Ve? — a 
and 


b = Ve? — a, 


Since a = a’ and c = c, we have b = y, By SSS, AABC c AA'B'C'. 


Figure 12.11 


5 
d 
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T 
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If the theory of plane area is set up by using postulates, then we can quite well 
use area to prove that these equations hold. Later on, however, we shall want to 
prove that simple figures such as triangles have areas, and that areas have the proper- 
ties that we might expect. And for this purpose we want to prove the following 
theorem without appealing to the theory of area. 


Theorem 5. In any triangle, the product of a base and the corresponding altitude 
is independent of the choice of the base. 
Restatement. Given AABC. Let AD be the altitude from A to BC, and let 
BE be the altitude from B to AC. Then 


AD: BC = BE- AC. 


FicunE 12.12 


Proof. Suppose that E # C and D # C, as shown in the figure. Then ZC — 
ZC, and Z BEC = ZADO, because both are right angles. Therefore ABEC ~ 


^ 
den ^ BE, BC ~ AD, AC. 
Thus 
AD _ AC 
BE BC’ 
and 


AD- BC = BE- AC, 

which was to be proved. 
If E = C, then AABC is a righ 
have D = C. A 


t triangle with its right angle at C and we also 


E-C a B 


FicvnE 12.13 


In this case, the theorem says trivially that ab — ba. 


Theorem 6. For similar triangles, the ratio of any two corresponding altitudes 


is equal to the ratio of any two corresponding sides. 
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RESTATEMENT. Suppose that AABC ~ AA'B'C. Let h be the altitude from 
A to BÓ, and let h’ be the altitude from A’ to B'O’. Then 


h AB 
Er 35 ABA 
A 
A 
B D C B D C 
Figure 12.14 


Proof. Let AD and A'D be the altitudes whose lengths are h and h’. If D = B, 
then D’ = B', and there is nothing to prove. If not, AABD ~ AA'B'D', and the 
theorem follows. 


ain FO 


POLYGONAL REGIONS AND 
THEIR AREAS 
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13.1 THE AREA POSTULATES 


A triangular region is a figure which is the union of a triangle and its interior 
like this: 


Figure 13.1 


The sides of the triangle are called edges of the region, and the vertices of the tri- 
angle are called vertices of the region. 
A polygonal region is a figure like one of these: 


Fieure 13.2 


To be exact, a polygonal region is a plane figure which can be expressed as the 
union of a finite number of triangular regions, in such a way that if two of the 
triangular regions intersect, their intersection is an edge or a vertex of each of them. 
In each of the illustrations in Fig. 13.2, the dotted lines indicate how the regions 
can be cut up into triangular regions in such a way that the conditions of the 
definition are satisfied. Of course, there is nothing unique about the way in which 
a polygonal region can be cut up into triangular regions. In fact, if such a process 
can be carried out at all for a particular figure, it can be done in infinitely many 
153 
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ways. For example, a parallelogram plus its interior can be cut up in at least this 


many ways: 


Fieure 13.3 


The theory of area is easiest to handle for the case where the figures that we are 
dealing with are polygonal regions. And the easiest way to set up the theory is 
to suppose that an area function is given, under which to each polygonal region 
there corresponds a positive number called its area. Thus we let ® be the set of 
all polygonal regions, and we add to our structure the function 


a:Q >R. 
Thus the total structure in our geometry is now 
[S, £, ®, d, m, a], 


and we need to state the postulates governing the area function a. (Here we are 
using the Greek letter alpha because the natural English letters have by now been 
used up for other purposes. We have used m for measure, and we want to go on 
using A and a to say, for example, that a is the length of the side opposite A in 
AABC.) 

Our postulates are as follows. 


A-1. a is a function ® — R, where & is the set of all polygonal regions and R is 
the set of all real numbers. 


A-2. For every polygonal region R, aR > 0. 


A-3. The Congruence Postulate. If two triangular regions are congruent, then 
they have the same area. 


A-4. The Additivity Postulate. If two polygonal regions intersect only in edges 


and vertices (or do not intersect at all), then the area of their union is the sum 
of their areas. 


Ficure 13.4 
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For example, a(R; U R2) = aR, + «Ra. Of course, areas cannot be added in 
this way if the intersection of the two regions contains à triangular region. Here 


a(Tı U T3) is obviously less than aT + aT. 


uS. Lr 


Fieure 13.5 


ion that satisfies Postulates A-1 through A-4, 


and we then agree to multiply all areas by 2, then we get another area function that 
also satisfies A-1 through A-4. Intuitively speaking, area measured in square 
inches satisfies all our postulates so far, and so also does area measured in square 
cubits. We therefore need a postulate which, in effect, chooses a unit of measure 
for us by describing à connection between area and distance. 

By a square region we mean the union of a square and its interior. Rectangular 
regions are defined in the same way. One way to fix the unit of measure is to take 
the following statement as & postulate: The area of a square with edge 1 is equal 


to 1. 1 


Note that if we have an area functi 


aR=a! 


1 
Ficure 13.6 


This would lead, however, to complications of a sort that we want to avoid in 
the present chapter. The difficulty is that when we try to prove that the usual 
we get involved in a proof which has a good bit 


area formula holds for rectangles, 
e basic similarity theorem. (See Section 13.6.) 


in common with the proof of th 
For the moment, we avoid this by using the area formula for rectangles as a postu- 


late. 


A-5. The Unit Postulate. The area 
base and its altitude. 


of a rectangular region is the product of its 


h R 
aR=bh 
FicunE 13.7 


Later, we shall see that we can get along without any of the Postulates A-1 
through A-5. It can be proved that there is an area function satisfying all of them. 
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In the meantime, however, we shall take the area function as given, subject to 
A-1 through A-5, and show how it can be put to work on various geometric problems. 
The first step, of course, is to get formulas for the areas of the simplest figures. 

Hereafter, for the sake of convenience, we shall speak of the areas of triangles, 
areas of rectangles, and so on; this is an abbreviation. Triangles and quadrilaterals 
are not polygonal regions, and obviously they are much too thin to have areas 
greater than zero. Also we shall abbreviate aA ABC as ABC, of JABCD as ABCD, 
and so on. This is consistent with our notation AB for the length of the segment 
AB. In each case, when the letters appear without decoration, the resulting 
expression denotes a number, and this number Measures, in some way, the cor- 


responding geometric figure. To repeat: 


ABC = aAABC, 
and 
ABCD = of ]ABCD, 
by definition. 


13.2 AREA THEOREMS FOR TRIANGLES AND QUADRILATERALS 


Theorem 1. The area of a right triangle is half the product of the lengths of 
its legs. 


A 


Ga re a 


ABC=}ab 
Fieure 13.8 


Proof. Given AABC, with a right angle at C. Let D be the point such that 
L]ADBC is a rectangle. By the additivity postulate, 


ADBC = ABC + ABD. 


By the congruence postulate, 


ABD = ABC. 
Therefore 
: ADBC — 2ABC. 
By the unit postulate, 
ADBC = ab. 
Therefore 2ABC = ab, and 
. ABC = tab, 


which was to be proved, 
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Theorem 2. The area of a triangle is half the product of any base and the cor- 
responding altitude. 


B 
B 


A=D e 54 D C 
Figure 13.9 


Proof. Given AABC. Let D be the foot of the perpendicular from B to AG; 
let AC = b, and let BD = h (as in each of the figures). There are, essentially, 
three cases to consider. 
(1) If A = D, then A ABC is a right triangle and 
ABC = 4bh, 
by Theorem 1. 
(2) A-D-C. Let AD = bi and DC = bz. By Theorem di, 


BDA=4byh and BDC = 4boh. 


By the additivity postulate, 
ABC = BDA + BDC. 


Therefore : 
ABC = ih + Roh = (di + b3)h 


= bh, 
which was to be proved. 
(3) D-A-C. Let v = AD. By Theorem 1, 
BDC = À(' + b)h. 


Also by Theorem 1, 
BDA = 3)'h. 


By the additivity postulate, 
BDC = BDA + ABC. 
Therefore 
ABC = BDC — BDA 
= 4b! + b)h — $b'h 
= ġbh, 
which was to be proved. 


Theorem 3. The area of a parallelogram is the product of any base and the 


corresponding altitude. 
ABCD = bh. 
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B F Cc 


A E b D 
ABCD — bh 
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Proof. Given a parallelogram []A BCD, with base b = AD and corresponding 
altitude h = BE. By the additivity postulate, 


y ABCD = ABD + BDC. 
By two applications of Theorem 2, 

ABD = %bh and BDC = 3th. 
(Details? We need to know that BC = b and DF = h.) Therefore 


ABCD = bh + 4bh = bh, 
which was to be proved. 
This is not the order of derivations that we most often see; usually we get the 


area formula for parallelograms first, and derive Theorem 2 from it. The “proof” 
of Theorem 3 then looks like this: 


h 


[^ 

1 

| 

I 

| 

| 

| 

a 

A E D F 
Fieure 13.11 


(1) ABCD = ABE + BCDE. 

(2) ABE = DOF, because AABE £& ADCF. 

(3) ABCD = BCDE + DCF. 

(4) BCDE + DCF = BCFE, by the additivity postulate. 

(5) BCFE = bh, by the unit postulate, because BCFE is a rectangle. 

This “proof” works only in the cases described by the figures that are drawn to 
illustrate it. Consider the following case. If the parallelogram looks like Fig. 
13.12, then the above discussion becomes nonsense in the very first step, because 
there is no such thing as “the quadrilateral []BCDE,” and even if we allowed 
quadrilaterals to cross themselves, the equation ABCD = ABE + BCDE would 
not hold for the areas of the corresponding polygonal regions. 
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ee ee. E 
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Theorem 4. The area of a trapezoid is half the product of the altitude and the 
sum of the bases. 


B b, e 


A b, D 
ABCD - by bh 


Ficure 13.13 


Proof. The main steps are as follows: 


(1) ABCD = ABD + BDC, 
(2) ABD = 3h, 

(3) BDC = $bih, 

(4) ABCD = 4(b1 + ba)h. 


Note that Theorem 4 includes Theorem 3 as a special case, because every 
parallelogram is à trapezoid. Note also that the proof of Theorem 4 is exactly the 
same as that of Theorem 3. The point is that, although the invalid proof of Theorem 
3 uses and needs the fact that AB || CD; the valid proof does not. 


Theorem 5. If two triangles have the same altitude, then the ratio of their areas 
is equal to the ratio of their bases. 


This theorem follows immediately from the area formula. If the triangles 
AABC and ADEF have bases bı, b2, and the corresponding altitude for each 
of them is h, then 

ABC duh ba 
DEF A br 


which was to be proved. In much the same way, we get the following theorem. 


Theorem 6. If two triangles have the same base, then the ratio of their areas is 
the ratio of their corresponding altitudes. 
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The following theorem is a corollary of each of the preceding theorems. 


Theorem 7. If two triangles have the same base and the same altitude, then 
they have the same area. 


b b 
Figure 13.14 


Theorem 8. If two triangles are similar, then the ratio of their areas is the square 
of the ratio of any two corresponding sides. That is, if 


AABC ~ ADEF, 


8- 0-0-0 


$ ; 
B 

ASE BN 

"au cL OF: "DEBE 


Fieure 13.15 


then, 


Proof. If the altitudes to AC and DF are and h’, as in the figure above, then 
we know by Theorem 6, Section 12.3, that 


gaa eal 
ng 
Now 
ABC Wh _ (b\(h 
DEF ie We d 


which was proved. 


13.3 APPLICATIONS OF AREA THEORY: A SIMPLE PROOF OF THE BASIC 
SIMILARITY THEOREM 


The theory of plane area is not merely an adjunct to the geometry on which 
it is based. If we use the area postulates A-1 through A-5 as part of our basic 
apparatus, then we can simplify some of our proofs quite considerably. Probably 
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the most striking simplification of this kind is in the proof of the basic similarity 
theorem (Theorem 1, Section 11.4). 

Consider first the case where the two transversals T' and 7" intersect in a point 
of L,. (It is quite easy, as we shall see, to pass from this to the general case.) 
The picture then looks like this: 


A 


Ficure 13.16 
We have given BB’ || CC’, and we want to prove that 


AB _ AB’, 
Bc XB 


The steps in the area proof are as follows. 
(1) ACBB' and AC' BB’ have the same base b = BB’, and the same correspond- 


ing altitude h. Therefore " 
CBB' — C'BB'. 


(2) Let us look at AABB' and ACBB’ sidewise, taking AB and BC as their 
bases. Then the corresponding altitudes are the same. In each case, the altitude 
is the length h’ of the perpendicular from B' to AC. Therefore 


ABB' AB 
CBB ^ BC' 
by Theorem 5, Section 132. 
(3) For exactly the same sort of reason, 


ABB’ _ AB'. 
CBB BC’ 


[To get this, merely replace B by B', and C by C’, in step (2).] 
(4) Therefore 
AB  ABB' ABB’ _ AB' 
p- BC = CBB’ CBB BC’ 


X which was to be proved. 
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Consider now the general case. Let 7"' be the line through A, parallel to 7", 
intersecting Lz and L3 in B" and C" Then 
AB | AB" 


Box BIG 
by the preceding proof. But we have 
ABCs A^B^ 
B' GU = BIC", 
because opposite sides of a parallelogram are congruent. Therefore 


AB | A'B! 
BO Bo 
which was to be proved. 


Figure 13.17 


This proof of the basic similarity theorem is, in a way, inelegant because it 
creates the impression that the theorem depends on the theory of area. As we have 
seen, this impression is false. The area proof has, however, an important virtue H 
that is, it makes the theorem a part of elementary geometry. 

The area proof, incidentally, is the one given in Euclid. This fact, and the 
proof itself, are not as widely known as they deserve to be. 


13.4 FURTHER APPLICATIONS OF AREA THEORY: THE PYTHAGOREAN THEOREM 


The Pythagorean theorem can be proved without using the concept of a simi- 
larity. Such a proof goes like this. Given a right triangle AABC, with legs of 
length a, b, and hypotenuse, c. 

Take a square [ |DEFG of edge a + b. In the square, construct four congruent 
copies of A ABC, as shown in the figure. (We construct them in the corners of 
the big square, using SAS.) Then ZKHI is a right angle, because ZDHK and 
ZEHI are complementary. For the same reason, all the angles of LJHIJK are 
right angles, and [ ]H /JK is a square. 
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Fieure 13.18 


By the additivity postulate for area, the area is equal to the area of the little 
square, plus the areas of the four right triangles. Since the right triangles are all 
congruent, we have 

DEFG = HIJK + 4- KDH 


or 
(a + b)? = c? + 4- hab 
or 
a? + 2ab + b? = c? + 2ab. 
Therefore 


a? 4 b? PS c? 
which was to be proved. | 
Euclid's proof also used areas, but it was different from the one above and 
considerably more complicated. Given a right triangle, he constructed squares 


on each of the sides, like this: 
I 


Fiure 13.19 
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Here AABC is a right triangle, with its right angle at C. The outer figures are 
squares, and CK is perpendicular to AB and FG, intersecting then at D and K. 
The main steps in Euclid’s proof were as follows. 

(1) ADKF = 2CAF. (The rectangle and the triangle have the same base AF, 
and the same altitude AD.) 

(2) EHCA = 2EAB. (The square and the triangle have the same base EA, 
and the same altitude AC.) 

(3) ZEAC-= ZFAD, because both are right angles, and ZCAD = ZCAD. 
Therefore LEAB = ZCAF. But EA = AC, because [ ]JEHCA is a square; 
and AB = AF, because [ ]ABGF is a square. By SAS, we have 


AEAB = ACAF. 
Therefore 
EAB = CAF. 


(4) From (1), (2), and (3), we get 
ADKF = EHCA. 


That is, the square at the upper left has the same area as the rectangle at the lower left. 
By exactly the same reasoning, we get the same conclusion for the square and 
rectangle on the right: 

KDBG = BCIJ. 


That is, the square at the upper right has the same area as the rectangle at the lower right. 

(5) Since the area of the lower square []ABGF is the sum of the areas of the 
two rectangles []A DKF and [ ]K DBG, it follows that the area of the square on 
the hypotenuse is equal to the sum of the areas of the squares on the legs. 

This proof is quite different in spirit, and in an important way, more elegant 
than the two that we have seen already. One way of putting it is that while our 
first two proofs depended on calculations, Euclid’s reasoning is more geometric 
and more conceptual. The figure that goes with it is more than a reminder of the 
hypothesis and the notation; it is, in a sense, a picture of the Pythagorean phe- 
nomenon, so that if you understand the figure, you understand why the theorem 
is true. 

In a way, the above presentation of Euclid's proof is misleading. It suggests 
that the Pythagorean theorem meant the same thing to Euclid, in the Elements, 
that it meant to us in Chapter 12 of this book; and this is far from being true. 
Theorem 2, Section 12.3, stated that under certain conditions, the numbers a, b, 
c must satisfy the equation a? + b? = c?: and the theorem appeared in a presenta- 
tion of geometry in which the real numbers are given, independently of geometric 
concepts, and are used in the geometry to measure things. Thus we use the number 
AB as the measure of the length of the segment 4B; we use the number ABC as 
the measure of the area of the triangle A ABC; and so on. 

In Euclid, there are no numbers independent of geometrie concepts, except, of 
course, for the positive integers, which are used to count things. To recast his 
theory, in a form meeting modern standards of explicitness and exactitude, is a 
rather formidable task. We have given a sample of this in Chapter 8. There we 
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4 1 
Ficure 13.20 3 Figure 13.21 


showed that instead of using the concept of length for segments (this length being 
a real number) you can use the concept of same-length: two segments have the same 
length if they are congruent, and congruence between segments is an undefined 
term, subject to certain postulates. Much the same thing can be done for the theory 
of area, but it is technically very difficult. It is therefore customary, especially in 
elementary courses, to use the far simpler apparatus of metric geometry. Often 
this is done unobtrusively, and it is not easy to see what is going on. A moment’s 
reflection, however, will convince us that any time you label a figure as shown in the 
illustrations above, you are doing metric geometry, whether or not you choose to 


say so. 


13.5 A WEAKER FORM OF THE UNIT POSTULATE A-5 


Our fifth postulate for area asserted that the area of a rectangle (that is, of a 
rectangular region) was the product of the base and the altitude. We remarked that 
the following postulate would have been sufficient. 

A-5'. If a square has edges of length 1, then its area is 1. 

This postulate gives us, in a minimal sense, a “unit of measure.” We shall 
show that it implies A-5. 

Theorem 1. If a square has edges of length 1/g (g an integer), then its area 

is 1/g?. 

Proof. A unit square region can be decomposed into q? square regions, all with 
the same edge 1/q and with the same area A (Fig. 13.22). Therefore 1 — A, 


and A = 1/q*. 
1 
a{ 


anf 


Ficure 13.22 
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Theorem 2. If a square has edges of rational length p/q, then its area is p?/q?. 
Proof. Such a square can be decomposed into p? squares, each of edge 1/g. 


al 


"m 


i Fieure 13.23 
If A is its area, then 
ac l1; p* 
=p wins 


Theorem 3. If a square has edges of length a, then its area is a?. 


Proof. Given a square S, with edges of length a. Given any rational number 
p/q, let Spjq be a square of edge p/q, with an angle in common with Sa, like 
this: 


Figure 13.24 


Each of the following statements is easily seen to be equivalent to the next. 
(1) p/q <a 

(2) 85,4 lies in Sa (with two rectangular regions and a square region left, over.) 
(3) Spia < aSa. 

(4) p?/q? < aSa. 

(5) p/a < Vasa. 


Since (1) and (5) are equivalent, we have 
a= aSa, 


so that a? = aSa, which was to be proved. 
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h 

b A |b 
b h 


Figure 13.25 


We can now prove A-5. Given a rectangle of base b and altitude h, we construct 
a square of edge b + h, and decompose it into squares and rectangles as shown 
in the figure above. Then 


(b +h)? = 2A + 41 + As; 
b? + 2bh + h? = 2A +h? +b, 
2bh = 2A, 


and A = bh, as desired. 
The above treatment is due to Peter Lawes, a student of the author. It is much 


simpler than the author’s first version of the proof. 


im 4 


THE CONSTRUCTION OF AN 
AREA FUNCTION 


14.1 THE PROBLEM 


In the preceding chapter we assumed that an area function was given, and that 
it satisfied the Postulates A-1 through A-5. This was a quite reasonable proceed- 
ing; in fact, in elementary geometry, it is the only approach that is simple enough 
to be manageable. 

It is natural, however, to ask whether this fairly complicated set of assumptions 
was merely a matter of convenience or actually a logical necessity. The question 
is whether on the basis of our other postulates, we can define an area function for 
polygonal regions and prove that our function satisfies the conditions A-1 through 
A-6. The answer is that we can. 

In setting up such a function, it is clear that we shall have to begin by assign- 
ing an area to some sort of figure. It seems pretty hopeless to try to do everything 
at once, by assigning an area to every polygonal region at the very start. We ought 
to begin by defining areas for certain simple figures, and try to handle the more 
complicated figures in terms of the simple ones. For this purpose, rectangles are 
not promising because they cannot be used as building blocks, except for very 
special figures. For example, a triangular region cannot be expressed as the union 
of a finite number of rectangular regions. It is true that we got the formula 4bh, 
for the area of a right triangle, from the formula bh for the area of a rectangle. 
But to do this, we had to suppose that there was such a thing as the area of a right 
triangle; and in our present program, the latter question is the whole point at 
issue. 

This suggests that our “point of entry” should be not rectangles, but triangles. 
We begin building our area function by stating, as a definition, that for any AABC, 


the area is ABC = 4bh, 


where b is any base and h is the corresponding altitude. This makes sense, because 
we have shown (Theorem 5, Section 12.3) that the product bh depends only on 
the triangle, and does not depend on the choice of the base. 

We now want to use triangles as building blocks. Given a polygonal region, we 
cut it up into a finite number of triangular regions intersecting only in edges and 
vertices (Fig. 14.1). For each of the triangles, the area is already defined, by the 
formula 35h. If our theory is going to work, then we ought to be able to get the area 
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Figure 14.1 


of the whole region by adding up the formula areas of the triangles. This suggests 
a tempting procedure: We might solve our problem at once by defining the area 
of the region as the sum of the formula areas of the triangles. 

There is, however, a difficulty. Any polygonal region can be cut up into triangular 
regions in infinitely many ways: 


Figure 14.2 


Therefore, before we can define the area of a polygonal region as the sum of the 
formula areas of the little triangular regions, we need to prove that this sum 
depends only on the region that we started with, and is independent of the way in which 
we cut it up. 

At first glance, this problem may seem trivial, and after a little reflection it may 
seem almost impossible. The truth is somewhere in between, as we shall see. 


14.2 COMPLEXES AND THEIR FORMULA AREAS 
Given a polygonal region R, expressed as the union of a finite number of tri- 


angular regions, intersecting only in edges and vertices. The set K whose elements 
are the triangular regions is called a complex, and is called a triangulation of R: 


Figure 14.3 
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In the figure, the complex K is the set 
(T, T», Ts, Ta, Ts}. 


Thus R and K are objects of quite different kinds. R is an infinite set of points, 
and K is a finite set of triangular regions. The vertices and edges of the T's are 
also called vertices and edges of K. 

We are going to define the area of a triangle by the formula bh. To avoid 
confusion between the area function that we shall finally set up and the apparatus 
that we are using in the process, we shall call this the formula area, and state our 
first official definition as follows. 


Dertnition 1. The formula area of a triangle is half the product of any base 
and the corresponding altitude. 


Derinition 2. The formula area of a complex is the sum of the formula areas 
of its clements. 


A strip complex is a complex that looks like either of the figures below: 


NNI ZN 


FiaunE 14.4 Fieure 14.5 


More precisely, a complex K is a strip complex (1) if K is a triangulation of a trape- 
zoid or'a triangle, (2) for the trapezoidal case, all vertices of K are on the upper 
or lower base, and (3) for the triangular case, all vertices of K are on the base or 
the opposite vertex. 

Note that these cases are being handled together, as if the triangle were a 


“trapezoid with upper base = 0,” The following theorem should also be inter- 
preted in this way. 


Theorem 1. The formula area of a strip complex is $(b; + b2)h, where bi 
and be are the bases and h is the altitude. 


e e e 


f hh h ts Is 
FicvnE 14.6 


COMPLEXES AND THEIR FORMULA AREAS 171 


Proof. The sum of the formula areas of the triangles with two vertices on the 
upper base is 
Zeyh + Seah + +--+ + desh, 


where e1, €2,..+, €, are the lengths of the segments into which the upper base is 
divided by the vertices. The sum of the formula areas of the other triangles is 


Mh + Mh + +++ + fuh. 


Therefore the formula area of the complex is 


3A(e + eo +++ + en) + Ma + fo +++: + Sm) = thbi + Mb; 
= $(bi + ba)h, 
which was to be proved. 
It will be convenient to generalize this result slightly. By a strip decomposition 
of a triangle or trapezoid, we mean a decomposition into triangles and trapezoids, 


like this: 


Figure 14.7 


The formula area of a trapezoid is defined to be $(b; + b2)h, where h is the altitude 
and b, and bz are the bases. The triangles and trapezoids in a strip decomposition 
are called its parts. 


Theorem 2. For any strip decomposition of a triangle or trapezoid, the formula 
area of the original figure is the sum of the formula areas of the parts. 


Proof. If none of the parts are trapezoids, this follows immediately from Theorem 
1. If some trapezoids appear, we split each of them into two triangles by putting 


in a diagonal: 


Figure 14.8 


This does not change the sum of the formula areas of the parts. Therefore our 
theorem follows from Theorem 1. 


Theorem 3. If A-D-C, A-E-B, and DE || BC, then the formula area of AABC 
is the sum of the formula areas of AADE and [ ]DEBC. 
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A 
c ™ B 
Ficure 14.9 


Proof. Let b; = BC and bz = DE; let hy and hz be the altitudes of [ ]DEBC 
and AADE. Our theorem then says that 


_ dogha + (bı + b2)hi = 3610 + he) 


or 
bohe + byhi + bah, = bihı + biho 
or 
balhi + ho) = biho 
or 
hi +h bi 
he be 


This is true, by Theorem 6, Section 12.3, because AADE ~ AACB. 

By a parallel decomposition of a triangular region, we mean a decomposition 
into one or two triangles and a finite number of trapezoids, like either of the 
following figures. (Let us spare ourselves the formal definition.) 


A 
B D 
A 
c A Cc 
(a) (b) 


Figure 14.10 


Theorem 4. For any parallel decomposition of a triangle, the formula area of 


the original triangle is the sum of the formula areas of the trapezoids and the 
triangles in the decomposition. 
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Proof. For Case 1 (Fig. 14.10b) this follows by repeated applications of the 
preceding theorem, working from the top downward. For Case 2, [Fig. 14.10(a)] 
we observe, first, that by Theorem 1, the formula area of A ABC is the sum of the 
formula areas of ABDC and AABD. (To apply Theorem 1, we must look at the 
figure sidewise. Now apply the result of Case 1 to each of these. We are now 
ready to prove our main theorem. 


Theorem 5. All triangulations of the same polygonal region have the same 
formula area. 


Proof. Given two triangulations K, and K3 of a polygonal region R. 


Figure 14.11 


In the figure, the edges of K, are solid, and those of Ky are dashed. 

We take a family of parallel lines Lı, Lo, ..., Ln passing through all vertices 
of K;, all vertices of Ke, and all points where edges of K, intersect edges of K3. 
(These are the horizontal lines, formed with long dashes, in the figure.) Now the 
lines L; give parallel decompositions of each triangle of Ky: 


Figure 14.12 


Let us call these triangles and trapezoids the primary parts of Kı. By definition, 
the formula area of K, is the sum of the formula areas of these triangles. We 
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apply Theorem 4 to each triangle, and add the results. This gives: 

(1) The formula area of K, is the sum of the formula areas of the primary parts 
of K 1: 

The edges of Ke give a strip decomposition of each primary part of Ky: 


Ficure 14.13 


Let us call these smaller triangles and trapezoids the secondary parts of Kı. (In 
Fig. 14.13, there are a total of eight secondary parts, in the triangle shown. In 
Fig. 14.11 at the beginning of the proof of Theorem 5, there are a grand total of 
31 secondary parts of Kı.) 

We know, by Theorem 2, that: 

(2) The formula area of each primary part of Kı is the sum of the formula areas 
of the secondary parts that lie in it. 

Combining (1) and (2), we get: 

(3) The formula area of K, is the sum of the formula areas of the secondary 
parts of Kı. 

Applying the same reasoning to Ko, we get: 

(4) The formula area of K3 is the sum of the formula areas of the secondary 
parts of Ko. 

This tells us all that we need to know, because the secondary parts of K3 are 
exactly the same as the secondary parts of Kı. Therefore, by (3) and (4), Kı 
and Ko have the same formula area, which was to be proved. Thus, we can state 
the following definition. 


Derinition. The area aR of a polygonal region R is the num»c: hich is the 
formula area of every triangulation of R. 


14.3 VERIFICATION OF THE AREA POSTULATES FOR THE FUNCTION o 


Trivially, we know that 
A-1. ais a function R — R. 
A-2. aR > 0 for every R. 


A-3. Any two congruent triangles have the same area. 
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These follow immediately from the definition of o. 
A-5. The area of a rectangle is the product of its base and its altitude. 


The reason is that either diagonal divides a rectangle into two right triangles, 
each of which has formula area —3bA. (This is exactly like the proof of Theorem 1, 
Section 12.1, only now it works in reverse.) 

We proceed to verify A-4. Given regions R, and Re, with triangulations Ky 
and Ks. Suppose that P, and Rz intersect only in edges and vertices. It may 


Fieure 14.14 


happen, as indicated in the figure, that some edges of K, (or Ko) contain more 
than one edge of Kə (or Kı). If so, we split some triangles in K, (or. Kə) into 
smaller triangles. 


Figure 14.15 


In this way we get new triangulations Kj, K 4 whose union K is a triangulation of 
R,U Ry. Now a(R, U Re) = aR; + aha, because the formula area of K is 
the sum of the formula areas of K, and Kə. Thus A-4 is satisfied by a. 
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PERPENDICULAR LINES AND PLANES 
IN SPACE 


15.1 THE BASIC THEOREMS 


Given a line L and a plane Æ, intersecting in a point P. 
If every line in E, passing through P, is perpendicular to L, then we say that L 
and E are perpendicular, and we write L L E, or E L L. In the figure, L is supposed 


Figure 15.1 


to be perpendicular to Æ. We have indicated two lines in E, passing through P. 
These are both perpendicular to L, although, in a perspective drawing, they don't 
look as though they are. Note that when we say that L 1 E, we are making a 
statement about an infinite collection of lines; that is, all of the lines that lie in 
E and contain P must be perpendicular to E. If we required merely that E contain 
one line perpendicular to L, this wouldn't mean a thing. You can fairly easily 
convince yourself that every plane that intersects L contains such a line. Soon 
we shall prove that if E contains two lines perpendicular to L, then E .L L. The 
following two theorems are preliminaries. 
A point A is equidistant from two points P and Q if AP = AQ (Fig. 15.2). 


Theorem 1. If A and B are equidistant 

from P and Q, then every point between 

A and B has the same property. 

The main steps in the proof are as follows. A È 
(1) APAB = AQAB. AN 

(2) ZPAB = ZQAB. 


(3) APAX = AQAX. i 
(4) PX — QX. Figure 15.2 
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Theorem 2. If a line L contains the midpoint of 7 PQ, and contains another point 
which is equidistant from P and Q, then L 1 PQ. 


3 P 
L A D 
| Q 
d 
Figure 15.3 


For APAD & AQAD by SSS. Therefore ZPAD = ZQAD, and each is a 
right angle. 


Theorem 3. If a line is perpendicular to each of two intersecting lines at their 
point of intersection, then it is perpendicular to the plane that contains them, 


Restatement. Let Lı and La be two lines intersecting at A, and let E be the 
plane that contains them. Let L be a line which is perpendicular to Lı and La 
at A. Then every line in E through A is perpendicular to L. 


Proof. Let P and Q be points of L such that P-A-Q and AP = AQ. Let Ls 

be any third line in E through A. Now each of the lines Lı and Lz contains points 

| on each side of La in E. Let B and C be points of Ly and Lə, lying on opposite 
| sides of La in E. Then BC intersects La in a point D # A. Now 


(1) APAB = AQAB, by SAS. 
(2) PB — QB. 

(3) Similarly, PC — QC. 

(4) PD — QD, by Theorem 1. 
(5) PO L Ls, by Theorem 2. 


E | 


178 PERPENDICULAR LINES AND PLANES IN SPACE 


The perpendicular bisector of a segment in a plane is the line which is per- 
pendicular to the segment at its midpoint. The following theorems are easy to 
prove and serve as an introduction to the analogous theorems in space. 


Theorem 4. If L is the perpendicular bisector of the segment AB (in a plane E), 
then all points of L are equidistant from A and B. 


The converse is also true. 


Theorem 5. Let A, B, and P be points of a plane E. If P is equidistant from 
A and B, then P lies on the perpendicular bisector of AB. 


Combining these two theorems, we get the following theorem. 


Theorem 6. The perpendicular bisector of a segment in a plane is the set of 
all points of the plane that are equidistant from the end points of the seg- 
ment. 


A theorem of this sort is called a characterization theorem. You have characterized 
a figure—that is, a set of points—if you state a condition that is satisfied by the 
points belonging to the given set, and by no other points. Usually the proof of a 
characterization theorem is in two parts. For example, let L be the perpendicular 
bisector of AB in the plane E and let 


W = (P|P lies in E, and PA = PB}. 


Theorem 4 says that every point of L lies in W; that is, L C W. Theorem 5 
says that every point of W lies in L; that is, W C L. Taken together, these state- 
ments tell us that L = W; and this is precisely the content of Theorem 6. 

Under a widespread usage, theorems like Theorem 6 are called locus theorems; 
people sometimes say that “the perpendicular bisector of a segment is the locus 
of all points of the plane that are equidistant from the end points.” The Latin 
meaning of the word locus is no clue to this usage, because in Latin the word simply 
means place, which doesn’t fit the context at all. The most straightforward way to 
describe the-usage is to say that a set of points is referred to as a locus if the 
speaker is about to give a characterization of the set. For this reason, the word 
locus can be thrown away without ever being missed; it has been superseded by 
the universally applicable word set. 

Pursuing the analogy of perpendicular bisectors in a plane, we may now be 
tempted to say that the perpendicular bisector of a segment (in space) is the plane 
which is perpendicular to the segment at its midpoint. This definition tacitly assumes 
that through the midpoint of a segment there is one and only one perpendicular 
plane; and therefore we—or rather you—should prove these statements, in order 
to legitimize the definition. : 


Theorem 7. Given a line L and a point P of L. There is at least one plane which 
is perpendicular to L at P. 
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Theorem 8. Given a line L and a point P of L. There is only one plane which 

is perpendicular to L at P. 

(Hint: L is the intersection of two planes E, and E;.] We can now prove theorems 
analogous to Theorems 4, 5, and 6. 


Theorem 9. Every point of the perpendicular bisecting plane of a segment is 
equidistant from the end points of the segment. 


FicunE 15.5 


Proof. Let A be the midpoint of PQ, let E be perpendicular to PQ at A, and let 
X be any point of E. If X = A, then X is equidistant from P and Q. If X # A, 
then by SAS we have APAX = AQAX, so that PX = QX, which was to be 


proved. 


Theorem 10. Every point equidistant from the end points of a segment lies in 

the perpendicular bisecting plane of the segment. 

RESTATEMENT. Let M be the midpoint of PQ, let E be the plane perpendicular 

to PQ at M, and let X be a point such that PX = QX. Then X lies in E. 
Proof. By Theorem 2, MX L PQ at M. Let F be the.plane that contains 
PQ : and X. Then F intersects E in a line L, and L L PQ at M. 'Therefore L — 
MX, because perpendicular lines in à plane are unique. Therefore X lies in E, 


which was to be proved. 
Fitting these theorems together, as in the planar discussion, we get a characteriza- 


tion theorem. 


Theorem 11. The perpendicular bisecting plane of a segment is the set of all 
points that are equidistant from the end points of the segment. 


We shall need the following theorem in discussing perpendiculars to planes 
through external points. 
Theorem 12. Any two lines perpendicular to the same plane are coplanar. 


Proof. Let Lı and Lz be perpendicular to the plane E at points A and B, re- 
spectively. Let M be the midpoint of AB, let L be the perpendicular bisector of 
AB in E, and let P and Q be two: points of L, equidistant from M. 
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Ficure 15.6 


We shall show that every point C of Lı is equidistant from P and Q. Now AP = 
AQ, because AB is the perpendicular bisector of PQ. Since Lı L E, it follows that 
ACAP = ACAQ. Therefore CP = CQ. 

In the same way it follows that every point of L3 is equidistant from P and Q. 
Therefore Lı and Lə lie in the same plane, namely, the perpendicular bisecting 
plane of the segment PQ. (See Theorem 11.) 

The rest of the theorems in this section are stated without proof; you should 
be able to furnish the proofs in each case. (But for Theorem 15 you will probably 
need the hints at the end of the section.) 


Theorem 13. Through a given point in a given plane there is at least one line 
perpendicular to the given plane. 


Theorem 14. Through a given point in a given plane there is at most one line 
perpendicular to the given plane. 


Theorem 15. Through a given point not in a given plane there is at least one line 
perpendicular to the given plane. 


Theorem 16. Through a given point not in a given plane there is at most one 
line perpendicular to the given plane. 


The preceding four theorems fit together to give the following theorems. 


Theorem 17. Given a point and a plane, there is exactly one line which passes 
through the given point and is perpendicular to the given plane. 


Theorem 18. If a plane E and line L are perpendicular at a point P, then E 
contains every line that passes through P and is perpendicular to L. 


The main stages in the proof of Theorem 15 are the following. We have given 
a plane Æ and an external point P. 

(1) Let L4 by any line in E. 

(2) Let E; be the plane containing P and Ly: 
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(3) Let Lz be the perpendicular from P to La, intersecting Lı at Q. 
(4) Let L3 be the perpendicular to L, at Q, in E. 
(5) Let Ez be the plane containing P and Ls. 
(6) Let L be the perpendicular from P to Ls, in Ev. 
We then show that L .L E. 
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1. Prove Theorems 7, 8, 13, 14, 15, 16, and 18, above. 

2. Show that if a line L contains two points equidistant from P and Q, then every point 
of L is equidistant from P and Q. 

3. Show that if a plane E contains three noncollinear points which are equidistant from 
P and Q, then all points of E are equidistant from P and Q. 


15.2 PARALLEL LINES AND PLANES IN SPACE 


Two planes are called parallel if they have no point in common. If the planes 
E, and E are parallel, then we write E, || Es. Similarly, a line L and a plane E 
are parallel if they have no point in common; we abbreviate this by writing L || E, 
or E || L. 

The theory of parallelism in space is closely analogous to that of parallelism in 
a plane. No new postulates are needed, and the proofs are easy, as we shall see. 


Theorem 1. If a plane intersects two parallel planes, then it intersects them in 
two parallel lines. 


Figure 15.7 


Proof. We have given a plane Æ, intersecting two parallel planes E, and E; 
in the nonempty sets Lı and Lz. By Postulate I-4, the sets Ly and L3 are lines. 
They are coplanar, because E contains both of them, and they have no point in 
common, because L; lies in E; and Lz lies in E2. Therefore Lı and L are parallel 
lines, which was to be proved. 


Theorem 2. Two lines perpendicular to the same plane are parallel. 
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Proof. Given Lı and Lə, perpendicular to a plane E at points P and Q. By 
Theorem 12, Section 15.1, Lı and Lz are coplanar. And each of them is perpendic- 
ular to the line PQ, because PQ lies in E. Therefore Lı || Lo. (Why?) 


Theorem 3. If a line is perpendicular to one of two parallel planes, it is perpen- 


dicular to the other. 
L 
^ iQ H 
'P, 


l 
| 
E 


Figure 15.9 


Proof. Let E, and E5 be the two parallel planes, and let L be a line perpendicular to 
E, at Pı. Thus we have given that every line in E; that contains P, is perpendic- 
ular to L. We need to prove two things: 


(1) L intersects E» (in a point P3). 
(2) Every line in E? that contains P? is perpendicular to L. 


Proof of (1). Let Qz be any point of Es, and let QiQ2 be the perpendicular 
segment from Q» to a point Q, of Hy. If Qı = P, there is nothing to prove, be- 
cause, in this case, Q1Q» = L, by Theorem 14, Section 15.1, and L therefore 
intersects E». 

Suppose, then, that Qı = P;. By the preceding theorem, it follows that L || 
QiQz. If it were also true that L || La, then it would follow that Lo || QiQz, which 
is false. Therefore L and Lə are not parallel; L and Lz intersect, at a point Po. 
Then P; lies in both L and Ez, which was to be proved. 
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Fieure 15.10 


Proof of (2). We now have given that L intersects E» at Po. Let L3 be any line 
in Es, passing through P5. We need to prove that Ly L L. 

Let E be the plane that contains L and Ls. Then Æ intersects E; in a line Ly. 
By Theorem 1, Lı || L2, and Lı L L, because every line in E, through P, is per- 
pendieular to L. Therefore Lg -L L, which was to be proved. 


Theorem 4. Any two planes perpendicular to the same line are parallel. 


Proof. Suppose that L is perpendicular to E; at P and perpendicular to E2 
atQ. If E, intersects E; at a point R, then APQR has right angles at both P and 
Q, which is impossible. 

PnopruEM Ser 15.2 

The following theorems have rather short proofs. Prove them. 

Theorem 5. A plane perpendicular to one of two parallel lines is perpendicular to the 

other. 

Remember that you must begin by showing that the given plane intersects the second 
of the given lines. 

Theorem 6. If two lines are each parallel to a third line, they are parallel to each 

other. 


Theorem 7. Two parallel planes are everywhere equidistant. That is, all perpendicular 
segments from one of the two planes to the other are congruent. 


Theorem 8. Let H bea half space with face E. Let e be 
a positive number. Let F be the set of all points Q of H | ee vt f 
whose perpendicular distance from E is =e. Then F 
is a plane. š 
[Hint: For the proof, let P be a point of E, and let AP 
be a segment perpendicular to E, such that AP = e. E 
Let E’ be the plane through A, perpendicular to AP. P 
Show that (1) F C E', and (2) E' CF. It will follow 
that F is a plane, namely, the plane E'] Ficure 15.11 
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We recall that a dihedral angle is the union of a line and two noncoplanar half 
planes having the line as their common edge. The line is called the edge of the 
dihedral angle, and the two half planes are called its sides or faces. 


Figure 15.12 


If PQi is the edge, and A and B are points on different sides, then the dihedral 
angle is denoted by ZA-PQ-B. It is clear, of course, that the dihedral angle i is 
completely determined when P, Q, A, and B are named. 

It is not hard to see that if E is a plane which intersects the edge PQ in only 
one point C, then E intersects Z A-PQ-B in an angle: 


Figure 15.13 


In the figure, E is the plane CDF, intersecting the dihedral angle Z A-PQ-B in 
ZDCF. The size of ZDCF depends, of course, on the position of the plane E. 
In the figure, ZDCF is “fairly large"; but if ZPCD and ZPCF are both very 
“small,” then Z DCF will also be “very small." 

If through a point C of the edge we pass a plane E, perpendicular to the edge, 
then the intersection of E with the dihedral angle is called a plane angle of the 
dihedral angle. We shall use the plane angles to define a degree-measure for dihedral 
angles. To do this, we need the following theorem. 


Theorem 1. Any two plane angles of the same dihedral angle are congruent. 


Proof. Let ZC and <D be two plane angles of the given dihedral angle. Take 
points P, Q, R, S on the sides of ZC and ZD, as in the figure, so that CP = DQ 
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Figure 15.14 


and CR = DS. (When we say “as in the figure,” this means that P and Q are 
on one side of the dihedral angle; R and S are on the other side of the dihedral 
angle.) 

The main steps in the proof are as follows. 

(1) CP || DQ. (These segments are coplanar, and perpendicular to the same 
line.) 

(2) CICPQD is a parallelogram. (A pair of opposite sides are congruent and 
parallel.) 

(3) PQ = CD. (Why?) 

(4) PQ || CD. (Why?) 

q^ CR || DS. 

(2^) [JC RSD is a parallelogram. 

(3) RS = CD. 

(4^) RS || CD. 

(5) PQ = RS (by (3) and (3^). 

(6) PQ || RS (by (4) and (4). 

(7) PR = QS. (Why?) 

(8) APCR = AQDS (by S88). 

(9) ZPCR = ZQDS, which was to be proved. 

We can now define the measure of a dihedral angle. The measure mZA-PQ-B 


is the number which is the measure of all plane angles of ZA-PQ-B. 

The same theorem enables us to define right dihedral angles. A dihedral angle is 
a right angle if its plane angles are right angles. Two planes are perpendicular if 
their union contains a right dihedral angle. 
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Prove the following theorems. 

Theorem 2. If a line is perpendicular to a plane, then every plane that contains the 
given line is perpendicular to the given plane. 

Theorem 3. If two planes are perpendicular, then any line in one of them, perpendicular 
to their line of intersection, is perpendicular to the other. 


| 
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CIRCLES AND SPHERES 


16.1 BASIC DEFINITIONS 


Let P be a point of a plane E, and let r be a positive number. The circle with 
center P and radius r is the set of all points Q of E whose distance from P is equal 
tor. Two or more circles with the same center are called concentric. 


Q 
j ZN 
R 
FicunE 16.1 


If Q is any point of the circle, then the segment PQ is a radius of the circle, 
and Q is called its outer end. If Q and R are any two points of the circle, then 
the segment QR is a chord of the circle. A chord that contains the center is called 
a diameter of the circle. Evidently the length of every diameter is the number 
2r. This number 2r is called the diameter of the circle. (Note that the word radius 
is used in two senses. It may mean either a number r or a segment PQ. But it 
will always be easy to tell which is meant. When we speak of the radius, we mean 
the number r, and when we speak of a radius, we mean a segment. Similarly for 
the two uses of the word diameter.) 

The interior of a circle is the set of all points of the plane whose distance from 
the center is less than the radius. The exterior of a circle is the set of all points of 
the plane whose distance from the center is greater than the radius. 


Exterior 


Ficure 16.2 
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The corresponding definitions for spheres in space are precisely analogous. 
They are as follows. 

Given a point P and a positive number r. The sphere with center P and radius r 
is the set of all points Q whose distance from P is equal to r. Two or more spheres 
with the same center are called concentric. 


Fieure 16.8 


If Q is any point of the sphere, then the segment PQ is a radius of the sphere, 
and Q is called its outer end. If Q and R are any two points of the sphere, then 
the segment QR is called a chord of the sphere. A chord which contains the center 
is called a diameter of the sphere. Evidently the length of every diameter is the 
number 2r. The number 2r is called the diameter of the sphere. 

The interior of a sphere is the set of all points whose distance from the center is 
less than the radius. The exterior of a sphere is the set of all points whose distance 
from the center is greater than the radius. 


. 


Prosiem Ser 16.1 

1. Show that every circle has only one center and only one radius. That is, if the circle 

with center P’ and radius 7’ is the same as the circle with center P and | radius r, then 
P'- Pandr’ = r. [Hint: Suppose that P # P’, and consider the line PP’.] 


16.2 SECANT AND TANGENT LINE. THE LINE-CIRCLE THEOREM 


Given a circle C and a line L in the same plane. If the line and the circle have one 
n the line is called a tangent line, and the common 


and only one point in common, the ; and 
point is called the point of tangency, or point of contact. If the line intersects the 
circle at more than one point, it is called a secant line. The following theorem is 


familiar, and is easy to prove. 


Theorem 1. If a line is perpendicular to a radius of a circle at its outer end, 


then the line is a tangent. 

Proof. Let C be a circle with center at P; let PQ be a radius, and let L be per- 
e to PQ at Q (Fig. 16.4). If R is any other point of L, then PR > PQ, 
because the shortest segment joining à point to a line is the perpendicular segment. 
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Ficure 16.4 


Therefore R is in the exterior of C. Therefore L intersects C only at Q and hence is 
a tangent line. 


The converse is also true. 


Theorem 2. Every tangent to a circle is perpendicular to the radius drawn to 
the point of contact. 


Fieure 16.5 


Proof. Let C be a circle with center at P, and let L be tangent to C at Q. Sup- 
pose that Q is not the foot of the perpendicular from P to L, and let R be the 
point which is the foot of the perpendicular. By the segment-construction postulate 
(or the segment-construction theorem, according to the chapter in which you refer 
to it), there is a point S of L such that Q-R-S and RQ = RS. By the Pythagorean 
theorem, applied twice, we have 

PR? + RS? = PS, | PR? + RQ? = PQ?. 


Therefore PS = PQ, S lies on the circle, and L is not a tangent line. 
The proofs of the following theorems are fairly straightforward. 


Theorem 3. Any perpendicular from the center of C to a chord bisects the chord. 


Theorem 4. The segment joining the center to the midpoint of a chord is per- 
pendicular to the chord. 


Theorem 5. In E, the perpendicular bisector of a chord passes through the center. 
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Circles with the same radius r are called congruent. By the distance between 
the center of a circle and a chord, we mean, of course, the perpendicular distance; 
that is, the length of the perpendicular segment from the center to the chord. 
Two chords are equidistant from the center if their distances from the center are 
the same. 


Theorem 6. In the same circle or in congruent circles, chords equidistant from 
the center are congruent. 


Theorem 7. In the same circle or in congruent circles, any two congruent chords 
are equidistant from the center. 


The following innocent looking theorem is of special interest. 


Theorem 8. The Line-circle Theorem. If a line intersects the interior of a circle, 
then it intersects the circle in exactly two points. 


Ficure 16.6 


Proof. Let C be the circle with center at P and radius r, and let L be the line. 
Let Q be the foot of the perpendicular from P to L. Since PZ < r for some point 
Z of L, it follows that PQ < r; that is, Q lies in the interior. 


ber PQ-—s«rf, 


as indicated in the figure. We want to prove that C intersects L in exactly two 


points, R and S. 
If X is à point where the line intersects the circle, then APQX has a right angle 


at Q. Therefore 


s? + QX? ae r?, 
by the Pythagorean theorem. Hence 
QX = vr? — 8?. 


And conversely, if X lies on L, and QX = vr? — s?, it follows that 
PX? = è + (vr? — 82)” 


=s?+r? — 8? 
2 


=r. 
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Now r? — s? > 0, because s < r. By the Euclidean completeness postulate, 
r? — s? has a positive square root 4/7? — s2. By the ruler, postulate, there are 
exactly two points X of L such that QX = vr? — s?. Therefore exactly two 
points lie both on the line and on the circle. 

This theorem is noteworthy, because to prove it we had to use the Euclidean 
completeness postulate, for the first time. "This fact should not surprise us, be- 
cause the theorem itself describes a completeness property of the plane. If the 
plane “had holes in it," and “some points that ought to be there were missing,” 
then the theorem would fail. f 

? 


/ 
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In the present treatment, this sort of thing is ruled out by the Euclidean com- 
pleteness of the real number system. In a purely synthetic treatment, Theorem 8 
should be taken as a postulate. 


16.3 SECANT AND TANGENT PLANES 


The analogy between the following discussion and the preceding one is fairly 
close. Given a sphere S and a plane E. If the plane and the sphere have one and 
only one point Q in common, then the plane is called a tangent plane, and the com- 
mon point Q is called the point of tangency, or point of contact. If the plane inter- 
sects the sphere in more than one point, it is called a secant plane. 


Theorem 1. Every plane perpendicular to a radius at its outer end is tangent to S. 


Proof. If R is any point of the plane Æ, other than Q, then PR > PQ, because 
the perpendicular segment from P to E is the shortest. 


Figure 16.8 
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Theorem 2. Every tangent to a sphere is perpendicular to the radius drawn to 
the point of contact. ; 


Fieurr 16.9 


Proof. Let E be tangent to Sat Q. Suppose that Q is not the foot of the per- 
pendicular from P to E, and let R be the point which is the foot of the perpendic- 
ular. Let C be the circle with center at R, with radius RQ, in the plane E. IT 
is any point of C, then RT L PR. Therefore, by the Pythagorean theorem, we have 


pT? = PR? + RT? 
= PR? + RQ’. 
But PR? + RQ? = PQ?. Therefore T lies not only in E but also in S. Thus E 
intersects S in an entire circle; and this contradicts the. hypothesis for E. 
Theorem 3. If a plane E intersects the interior of S, then Æ intersects S in a 
circle. 
Proof. Let Q be the foot of the perpendicular from the center P of S to the plane 
E, and let s = PQ. Then s < r. Therefore r? — s? > 0. Let 
t= vr? — 8. 


, again.) It is now straightforward to check that 
precisely the circle in E with center at Q and radius t. 
fairly straightforward, and are omitted. 


(By Euclidean completeness 
the intersection of S and E is 
The proofs of the following theorems are 


Figure 16.10 
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Theorem 4. The perpendicular from the center of S to a secant plane E passes 
through the center of the circle in which Æ intersects S. 


Theorem 5. If the plane E intersects S in a circle C, then the segment between 
the center of S and the center of C is perpendicular to E. 


Theorem 6. If E, S, and C are as in the preceding theorem, and L is a line 
perpendicular to E at the center of C, then L passes through the center of S. 


16.4 ARCS OF CIRCLES 


A central angle of a given circle is an angle whose vertex is the center of the circle. 


A 


Major Minor 
arc arc 


B 
Ficure 16.11 


Let A and B be the points in which the sides of the central angle intersect the circle, 
so that the central angle is Z APB. The minor arc AB is the set consisting of 
A and B together with all points of the circle that are in the interior of Z APB. 
The major arc ÁB is the set consisting of A and B together with all points of the 
circle that lie in the exterior of Z APB. In either case, points A and B are called 
the end points of the arc. 

If A and B are the end points of a diameter, then there are two ares with A 
and B as end points. Each of these ares AB consists of A and B together with all 


points of the circle that lie on a given side of the line. These are called semi- 
circles. 


Of course the notation AB for ares is always ambiguous, because there are 
always two different arcs with A and B as end points. In cases where misunder- 
standing might occur, we remove the ambiguity by taking some third point X 
of the arc, and then denoting the are by AXB. In the figure, ÁXB is a minor 
arc, ÁYB is the corresponding major are, and CAB and CYR are semicircles. 


EI 


Figure 16.12 
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The degree measure mÁXB of an arc AXB is defined in the following way. 


(1) If AXB is a minor are, then mAXB is the measure mZ APB of the cor- 
responding central angle. 


(2) If AXB is a semicircle, then mAXB = 180. 
(3) If AXB is a major are, then mAXB = 360 — mZAPB. 


mAXB=r, mCXB- 180, 
mACB-360—r. 
Figure 16.13 


The following theorem says that degree measure for arcs is additive in the way 
that we might expect. 


Theorem 1. If AB and BC are arcs of the same circle, having only the point B 


=> => = 
in common, and their union is an are AC, then mAB + mBC = mAC. 


To put it briefly, we always have ^ 


mABC = mÁB + mBC. 


The proof is a little tedious, because we need to discuss five cases, but each of 
the five cases is easy. We describe them, give the figures, and leave the verifica- 
tions to the reader. 


CASE 1. ABC is a minor are. 
CASE 2. ABC is a semicircle. 


A 


Ficure 16.14 


In these first two cases, mAB and mBC are simply mZ APB and mZBPC. 
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case 3. ABC isa major arc, and A and C are on opposite sides of the diameter 
that contains B. (What are the equations relating r, u, s, and t?) 


Figure 16.15 


case 4. ABC is a major are, and A and C are on the same side of the diameter 
that contains B. 


Ficure 16.16 


case 5. ABO is a major are, and one of the ares AB, Be is a semicircle. (Here 
mAB = 360 — t = 180 + 180 — t = 180 + s = mAB + mBC.) 


Figure 16.17 


In the figures below, the angle ZABC is inscribed in the dotted are ABC. 
To be exact, an angle is inscribed in an arc of a circle if (1) the two end points of 
the arc lie on the two sides of the angle, and (2) the vertex of the angle is a point, 
but not an end point, of the are. (We can put this more briefly: Z ABC is in- 
scribed in ABC, by definition.) 


Ficure 16.18 
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es r 
~ 


Figure 16.19 


In the figures above, the indicated angle intercepts the dotted arc. In the third 
of these cases, the angle intercepts not only the dotted arc but also the are AXB. 

We shall now give à mathematical definition of the idea conveyed by the figures 
above. An angle intercepts an arc if (1) the end points of the arc lie on the angle, 
(2) each side of the angle contains at least one end point of the are, and (3) except 
for its end points, the arc lies in the interior of the angle. 


Theorem 2. The measure of an inscribed angle is half the measure of its in- 
tercepted arc. 


Restatement. Let ZA be inscribed in an are BAC of a circle, intercepting the 
are BC. ThenmZA = 3mBC. 


Proof. case 1. Consider first the case where ZA contains a diameter of the 
circle. Lel Zo ZABE Ey S ZBPC, and Zz = ZAPB, as in the figure. 


Ficure 16.20 
Then we have 
mZA + mZz + mlz = 180 and  mZz- mZy-— 180. 


Since A and B are on the circle, we have PA — PB. Therefore, by the isosceles 
triangle theorem, we have mZ.A = mZz, so that 


2mZ.A = 180 — mZz 
= mLy 
a 
= mBC. 


Therefore = 
mZA = 3mBC, 


which was to be proved. 
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CASE 2. Suppose that B and C are on opposite sides of the diameter through A. 


Then 
mZA = mZv+mZw. 


B 
Also, by Case 1, 
mZv = 4mBD, 
and A € D 


mlw = 4mD0. 
By Theorem 1, fv Cum 
mBD + mDC = mBC. ^ 
"Therefore 
mZA = kmBC, Ficure 16.21 


which was to be proved. 
CASE 3. Suppose that B and C are on the same side of the diameter through A. 

Here 

mLrı + mLy = mzs, B 
and : cs "S ee c 

mBC + mCD = mBD. Dm 
"Therefore ae D 

mZA = mZy = mZz — mZz 
= 4mBD — 4D, 

by Case 1. Therefore d 
: mZA = imBC, Ficure 16.22 


which was to be proved. 
This theorem has two immediate consequences. 


Theorem 3. An angle inscribed in a semicircle is a right angle. 


Theorem 4. All angles inscribed in the same are are congruent. 


PROBLEM Ser 16.4 


1. Given two circles with a common tangent at a point A such that the second circle 
passes through the center of the first. Show that every chord of the first circle that passes 
through A is bisected by the second circle. 

2. Three or more points are called concyclic if there is a circle that contains all of them. 
Show that every three noncollinear points (in a plane) are concyclic. 

3. Show that three collinear points are never concyclic. 

4. An inscribed quadrilateral is one whose vertices are concyclic. Prove that in an 
inscribed quadrilateral each pair of opposite angles are 
supplementary. 

5. Show, conversely, that if a pair of opposite angles of 
a convex quadrilateral are supplementary, then the quadri- 
lateral is inscribed. 

6. A pair of parallel lines intercept the arc AB of a circle 
if (1) the lines intersect the circle at A and B, and (2) every 
other point of AB lies between the two lines. Ficure 16.23 
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Theorem 5. If two parallel lines intersect a circle, then they intercept congruent arcs. 
(There are three cases to be considered: two secants, two tangents, one secant, and one 
tangent.) 

Theorem 6. In the same circle or in congruent circles, if two chords are congruent, 
then so also are the corresponding minor arcs. 

Theorem 7. In the same circle or in congruent circles, if two arcs are congruent, then 
so are the corresponding chords. 

Theorem 8. Given a circle, and an angle formed by a secant ray and a tangent ray 
with its vertex on the circle. Then the measure of the angle is half the measure of its 


intercepted arc. 
Theorem 9. No two different circles intersect in more than two points. 


16.5 THE TWO-CIRCLE THEOREM 


Let us now proceed to discuss tangent lines to a circle through an external 
point. The fact is that given a circle C and a point Q of its exterior, there are always 
exactly two lines which pass through Q and are tangent to C: 


A 


B 


Ficure 16.24 


The natural way to try to prove this is as follows. Let M be the midpoint of the 
segment PQ, where P is the center of C. Let C’ be the circle with center M and 


radius MP = MQ. 


Figure 16.25 


If C" intersects C in two points A and B, as the figure suggests, then QÀ and OB 
are tangent to C at A and B, respectively. The reason is that the angles ZPAQ 
and ZPBQ are inscribed in a semicircle and hence each is a right angle. We can 
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now apply Theorem 1, Section 16.2, which says that a line perpendicular to a 
radius at its outer end is tangent to the circle. In this reasoning there is a dangling 
if. To complete the above proof that there are two tangents through Q, we need 
to show that C and C’ intersect in two points. For this purpose we need the follow- 
ing theorem. 


Theorem 1. The Two-Circle Theorem. Let C and C’ be circles of radius a and b, 
and let c be the distance between their centers. If each of the numbers a, b, c 
is less than the sum of the other two, then C and C’ intersect in two points. 
And the two points of intersection lie on opposite sides of the line of centers. 


Before proceeding to prove the theorem, let us see how it applies to our problem 
in connection with the external tangents through Q. In Fig. 16.25, let the radius 
of C be a, and let the radius of C’ be b = PM. The distance between the centers 
isc = MP = b. Since Q is an external point, we have 


PQ » a, 
so that 
a « 2b. 


Then (1) a € b + c, because b + c = 2b, and a < 2b. Also (2) b <a+e, 
because b + c = 2b. Finally, (3) c < a + b, because c = b. Therefore the 
two-circle theorem applies, and so it follows that C and C" intersect in two points 
A and B. Therefore there are at least two tangents to C through Q. Later (in the 
next section) we shall show that there are exactly two tangents through Q. The 
rest of this section will be devoted to the proof of the two-circle theorem. 

If a, b, and c are the lengths of the sides of a triangle, then each of the numbers 
a, b, cis less than the sum of the other two. (We know this by three applications of 
the triangular inequality.) We shall prove the converse. 


Theorem 2. The Triangle Theorem. Given three positive numbers a, b, c. If 
each of these numbers is less than the sum of the other two, then there is a tri- 
angle whose sides have length a, b, c. 


Proof. Without loss of generality, let us suppose that 
azbzc. 


Take a segment BC, of length a. We want to find a point A such that AB = c 
and AC = b. We shall start by assuming that there is a triangle A ABC, of the 


AQ 


Figure 16.26 
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sort that we are looking for, and then find out where the point A must be. Of 
course this procedure will not prove anything in itself, because we are starting by 
assuming the very thing that we are supposed to be proving. But once we have 
found the exact location of the points that might work, it will be easy to check 
that they really do work. 

Suppose, then, that A ABC is given, with sides of the desired length, as indicated 
in Fig. 16.26. Let D be the foot of the perpendicular from A to BC. Then B-D-C, 
because BC is a longest side of AABC. Therefore, if BD = z, then DC = a — z. 
Let AD = y. Then by two applications of the Pythagorean theorem, we have 


P=? a, a) 
y? = b? — (a — 2)”. (2) 
Therefore 
c? — r? = b? — (a — 2)?, 
so that 
c? — r? = b? — a? + 2ar — z?, 
and 
2az = a? + c? — b’. 
Therefore 
a? ES ce IÍ b? 
ratte, (3) 
and from (1), we get 
y = vc? — a?. (4) 


What we have proved so far is that if x and y satisfy (1) and (2), then x and y 
satisfy (3) and (4). We shall check, conversely, that if z and y satisfy (3) and (4), 
then x and y satisfy (1) and (2). Half of this is trivial. If (4) holds, then so also does 
(1). Suppose then, that (3) is satisfied. Reversing the steps in the derivation, we 


get 
c) — z? = b? — (a — 2)”. 


Since (1) is known to hold, it follows that 
y! =v? — (a — 2), 
which is Eq. (2). 
We can summarize this by writing 
(1) and (2) €» (3) and (4). 


Now that we know what triangle to look for, let us start all over again. We have 
three positive numbers a, b, c, with 
agta 
Let 
2 2 2 
a° +c —b 
z= 2a , 
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then x > 0, because a? = b? and c? = 0. We want to set 
y = ve — gt, 


but first we have to prove that c > z, to make sure that the radicand is positive. 
Obviously it will be sufficient to show that c — x > 0. Now 


» cong’ aig Eaton bt 
2a 
| 2ac — a? — cV 
mi 2a 
_ b? — (a? — 2ac + c?) 
A 2a 
|. — (a— o? 
M RO om ri 


We now know that a < b +c. Therefore a — c < b. Since both a — c and b 
are 20, it follows that (a — c)? < b?; and this means that c — x > 0, ore > z. 


Fia at 16.27 


We are now ready to construct our triangle. Let BC be a segment of length a. 
Let D be a point of BC such that 


s. +c —P? 

BD -r-— Um (3) 

Let DE be a ray starting at D, perpendicular to BC, and let A be a point of DE 
such that 

AD =y = Vc? — mz (4/) 

Since x and y satisfy (3) and (4), it follows that x and y satisfy (1) and (2). Thus 

a + y? = o? (1’) 

(a — 2)? + y? = b?. (2^) 


But «x? + y? = AB?, and (a — z)? + y? = AC?. Therefore AB? = c? and 
AC? = b?. Since b and c are positive, this means that AB = c and AC = b. 
Therefore A A BC is a triangle of the sort that we were looking for. 
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On the basis of the triangle theorem, it is easy to prove the two-circle theorem. 
We have given a circle C, with center P and radius a, and a circle C’, with center 
M and radius b: 
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The distance PM between the centers is c, and each of the numbers a, b, c is less 
than the sum of the other two. Therefore there is a triangle ARST, with RS = a, 
ST = band RT — c: 


Fieure 16.29 


Let A be a point in the plane of our two circles such that ZAPM = ZR and 
AP = a= RS. By SAS, ARST = APAM, so that AM = ST = b. Thus 
A is on both C and C’. Let B be a point on the opposite side of PM from A, such 
that ZBPM z ZR and BP = a= RS. By SAS, ARST = APBM, so that 
B is on both C and C’. It is not hard to check that these points A and B are the 
only points where the two circles intersect. (This was Theorem 9 in Problem 
Set 16.4.) 

The two-circle theorem should have been stated as a postulate in Euclid's 
Elements, but the necessity of this was overlooked. 


cure | 
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17.1 THE MOST GENERAL CONCEPT OF CONGRUENCE 


By now we have given five different definitions of the word congruent, for five dif- 
ferent kinds of figures. Two segments are congruent if they have the same length. 
Two angles are congruent if they have the same measure. Two triangles are con- 
gruent if there is a one-to-one correspondence between their vertices, such that 
every pair of corresponding sides are congruent (that is, have the same length) 
and every pair of corresponding angles are congruent (that is, have the same meas- 
ure). Two circles are congruent if they have the same radius. Finally, two cir- 
cular arcs are congruent if (1) the circles in which the ares lie are congruent, and 
(2) the arcs have the same degree measure. All this is quite correct, logically speak- 
ing, but in a number of ways it leaves things to be desired. 

In the first place, it was promised at the outset that the intuitive meaning of 
the word congruent was always going to be the same: two figures were going to 
be called congruent if they had exactly the same size and shape, that is, if one could 
be moved so as to coincide with the other. This promise has, in a sense, been kept. 
It is not hard to convince yourself that all five of the technical definitions that we 
have just reviewed have this intuitive meaning. On the other hand, it is rather 
artificial to have five different definitions to convey the same idea in five different 
cases. It would be much better to have one definition which applies in the same 
way to segments, angles, triangles, and so on. 

In the second place, as a matter of common sense, most of us would agree that 
two squares with edges of the same length ought to be congruent: 


a a 
Ll i] 
a a 
Figure 17.1 


If the language of exact geometry doesn’t allow us to say so, then the language 
of exact geometry must, so far, be inadequate. 
Finally, it would be good to make some sort of contact with the old Euclidean 
idea of congruence. Euclid based all of his congruence proofs on a postulate that 
202 
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said that “things which coincide with one another are equal to one another.” 
(See the Common Notions, in Book I of the Elements.) This was not adequate to 
account for the things that Euclid actually did. Strictly speaking, figures coincide 
only with themselves. And it is plain that the idea of motion, or superposition, 
is implicit in Euclid’s congruence proofs. Some authors have attempted to make 
this idea explicit by stating a postulate to the effect that “geometric figures can 
be moved without changing their size or shape.” But this still is not enough; it 
clarifies the difficulty without removing it. The difficulty is that while the term 
figure is plain enough (a figure is a set of points) the terms moved, size, and shape 
have a very insecure status. They must be regarded as undefined terms, in the 
light of the plain fact that no definitions have been given for them. But if they are 
undefined, then postulates must be given, conveying their essential properties; and 
this has not been done either. Of course, the general drift of the postulate is plain 
enough. But you cannot base a mathematical proof on a general drift. 

It is possible, however, to formulate Euclid’s idea, and everybody’s intuitive 
idea, in an exact mathematical way. This we shall now proceed to do, by defining 
the general idea of rigid motion, or isometry. 

The simplest instance of this is as follows. Consider a rectangle []A BB’ A’. 
(The sides of the rectangle are dotted in the figure 
because we are really interested in the two bases 4 
only.) Let f be the vertical projection, 


E 

fa. — I l l 

f: AB € A'B', | 
| l | 

l l l 


P Cala 


| 
| 
l 
| 
l 
l 


of the upper base onto the lower base. Thus, for 1 
each point P of AB, f(P) is the foot of the perpen- A’ fp)-P' f(Q)y- Q' P 
dicular from P to A'B'. We know, of course, that 
f is a one-to-one correspondence between AB and 
A'B’. That is, to every point P of AB there cor- 
responds exactly one point P’ = f(P) of A'B’, and to each point P' of A'B' 
there corresponds exactly one point P = f^ (P^) of AB. And this correspondence 
f has a special property: if P and Q are any two points of AB, and P’ and Q’ are 
the corresponding points of A’B’, as in the figure, then 


PQ’ = PQ, 


because the segments PQ and PQ are opposite sides of a rectangle. Thus, for any 
two points P, Q of AB, the distance between f(P) and f(Q) is the same as the distance 
between P and Q. More briefly, the correspondence f preserves distances between 


points. Pe 5 
The correspondence f: AB + A’B’ is our first and simplest example of what is 


called a rigid motion, or an isometry. The general definition of this concept is as 
follows. 


Derinition. Let M and N be sets of points, and let 
fJ:M —N 


Figure 17.2 
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be a one-to-one correspondence between them. Suppose that for every two 
points P, Q of M we have 


JPK) = PQ. 


Then f is called a rigid motion, or an isometry, between M and N. [Here f(P)f(Q) 
denotes, of course, the distance between the points f(P) and f(Q).] If there is 
an isometry between M and N, then we say that M and N are isometric, and 
we write 

M =N. 


In this language, we can sum up our discussion of the vertical projection 
f: AB © A'B' in the form of the following theorem. 


Theorem 1. Opposite sides of a rectangle are isometric. 


PnonBLEM Ser 17.1 
1. Consider two triangles A ABC and A A'B'C', and suppose that 
AABC S AA'B'C'. 
Let 
V = {A,B,C} 
y' = (A', B', C^). 
(Thus V and V’ are finite sets of three elements each. Does it follow that V = V’? 
That is, is there a rigid motion 
f: Ve v 
2. Let V be the set of vertices of a square of edge 1, and let V^ be the set of vertices of 
another square of edge 1. Show that V ~ V’. (First you have to set up a one-to-one cor- 
respondence f: V <> V’, and then you have to show that f is an isometry.) 
3. Do the-same for the sets of vertices of the two parallelograms in the figure. 


4. Show that if V is a set of three collinear points, and V’ is a set of three noncollinear 
points, then V and V’ are not isometric. 


5. Show that two segments of different lengths are never isometric. 
6. Show that a line and an angle are never isometric. 
7. Show that every two rays are isometric. 


A B Ww Z 
D 3 c X 3 Y 
FicunE 17.3 


8. Show that two circles of different radius are never isometric. 


9. Let L and I’ be two lines in the same plane, and let f : L €? L’ be the vertical pro- 


jection of L onto L’. Show that (1) if L || Z’, then f is an isometry, and conversely (2) 
if f is an isometry, then L || L’. 
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10. Show that isometry is an equivalence relation. That is, 


M = M for every set M (Reflexivity). 
If M = N, then N ~ M (Symmetry). 
If Mı ~ M» and M2 = Ms, then Mı =~ M3 (Transitivity). 


17.2 ISOMETRIES BETWEEN TRIANGLES 


Theorem 1 of the preceding section was, of course, more special than it needed 
to be. More generally, we have the following Theorem. 


Theorem 1. If AB c CD, then there is an isometry f: AB + CD, such that 

f(A) = C and f(B) = D. 

Proof. First we need to define a correspondence f between the two segments, 
and then we need to show that f preserves distances. 

On AB let us set up a coordinate system, in such a way that the coordinate of 
A is 0 and the coordinate of B is positive. (It follows, of course, that the coordi- 
nate of B is the number AB.) 


A P Q B C P' Q' D 
[^] x y AB o x y AB 
FIGURE 17.4 Figure 17.5 


Similarly, we set up a coordinate system on CD in such a way that the coordinate 
of C is 0 and the coordinate of D is positive (and hence = CD = AB). The 
figures suggest how the correspondence f ought to be defined. Given a point P 
of AB, the corresponding point f(P) of CD is the point P’ which has the same co- 
ordinate as P. Obviously this is a one-to-one correspondence between the two 
segments. And distances are preserved. Proof: Suppose that P and Q are points 
of AB, with coordinates x and y. Then P' = f(P) and Q' = f(Q) have the same 
coordinates z and y, respectively. Since 


PQ = |z — yl, 
and 
P'Q' Pe lx oF. yl, 
it follows that 
PQ = PQ, 


which was to be proved. 
We can restate Theorem 1 in the following way. 


Theorem. Given a correspondence 
Aoc, BeD 


between the end points of two segments. If AB = CD, then there is an isometry 
f: AB «+ CD which agrees with the given correspondence at the end points. 
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Here f is called the isometry induced by the given correspondence. If we think 
of the theorem in these terms, then the extension to triangles is immediate. 


Theorem 2. Given a correspondence 
ABC e DEF 
between the vertices of two triangles. If 


AABC = ADEF, 
then there is an isometry 
f: AABC e ADEF, 


such that f(A) = D, f(B) = E, and f(C) = F. 


Proof. Let b bre: 
fı: AB e DE 

be the isometry induced by the correspondence A + D, B + E. Similarly, let 
fe : BC e EF 


be the isometry induced by the correspondence B €» E, C €» F; and let 
fa: AC e DF 


be the isometry induced by the correspondence A €» D, C +> F. Let f be the cor- 
respondence obtained by combining fı, f, and fs. That is, if P is on AB, then 
fJ(P) = fi(P); if P is on BC, then f(P) = fo(P), and so on. 

Since each f; is an isometry, f preserves the distance between any two points 
that lie on the same side of A ABC. Thus it remains only to show that f preserves 


B E 


A Q c D KQ) F 
Ficure 17.6 


the distance between any two points P, Q on different sides of AABC. Suppose, 


without loss of generality, that P is on AB and Q is on AC, as in the figure (Fig. 
17.6). Let P’ = f(P) and let Q’ = f(Q). Then 


because f; is an isometry; 
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because fz is an isometry; and 
ZPAQ & ZP'DQ', 
because AABC = Z DEF. By SAS, we have 


APAQ = AP'DQ', 
so that 
PR = P'Q', 
which was to be proved. 

The isometry f that was defined in the proof of Theorem 2 is called the isometry 
induced by the congruence AABC = ADEF. 

You may wonder, at this point, why we didn’t define congruence by means of 
isometry in the first place. The reason is that in the sort of geometry that we have 
been discussing so far in this book, the elementary definitions based on distance, 
angular measure and correspondences between the vertices of triangles are the defini- 
tions that are convenient to work with. Thus, if we had defined ZA 2 ZB to 
mean that ZA ~ ZB, the next thing that.we should have done is to prove that 
ZA ~ ZB if and only if mZ A = mZB, so that we could work with the latter 
statement instead of the former. Similarly, we would have shown that AABC = 
ADEF if and only if the triangles are congruent in the elementary sense, and 
this would enable us to talk about correspondences between triplets of points, in- 
stead of talking about correspondences between infinite sets of points. In general, 
basic definitions in mathematics should be stated in such a way that they can be 
put to work quickly and easily. 
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1. Suppose that the correspondence 
ABC € A'B'C' 


is an isometry. Show that if A-B-C, then A’-B’-C’. 

2. Given an isometry 

f[:M e N. 

Let A and B be points of M. Show that if M contains the segment between A and B, 
then N contains the segment between f(A) and f(B). 

3. Show that if M is convex, and M = N, then N is convex. 

4. Given M = N. Show that if M is a segment, then so is N. 

5. Given M = N. Show that if M is a ray, then so is N. 

6. Suppose that M is a segment and N is a circular arc. Then M and N are not iso- 
metric. 
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CARTESIAN COORDINATE SYSTEMS 


18.1 COORDINATES, EQUATIONS, AND GRAPHS 


Obviously, all readers of this book know about coordinate systems, from ele- 
mentary analytic geometry. For the sake of completeness, however, we explain 
them here from the beginning. To achieve speed and simplicity, and reduce 
the amount of outright repetition, we have introduced various novelties in the 
derivations. 

In a plane E we set up a Cartesian coordinate system in the following way. 
First we choose a line X, with a coordinate system as given by the ruler postulate. 
The zero point of X will be called the origin. We now take a line Y, perpendicular 
to X at the origin, with a coordinate system in which the origin has coordinate = 0. 


P 
N 
y Gps 
2 [ 
" l 
| 
} | 
| 
IM 
I——3À + eop yy 
EEA VES j^ APER 
-1 
-2 
Figure 18.1 


Given a point P of E, we drop a perpendicular to a point M of X. The coordinate 
xof M on X is called the z-coordinate, or the abscissa, of P. We drop a perpendicular 
from P to a point N of Y. The coordinate of N on Y is called the y-coordinate, or 
the ordinate, of P. Thus to every point P of E there corresponds an ordered pair 
(x, y) of real numbers, that is, an element of the product set R X R. Clearly this 
is a one-to-one correspondence 

EoRXR. 


For short, we shall speak of “the point (x, y),” meaning, of course, the point cor- 
responding to (z, y) in the coordinate system under discussion. 
208 
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Figure 18.2 


Theorem 1. The distance between the points Py = (zı, yi) and P2 = (t2, y») 
is given by the formula 


P,P. = V(@2 — %1)? + (ys — v)*. 


Proof. Let My, N1, Mo, Na be the projections of P; and P» onto the axes, as 
in the definition of coordinates. If xı = 2, then 


P;P2 || NiNs — dz — zı] = 0, 


and 
PPa = |y2— vil 


zs)" 
= s — 2)? + (Y2 — y)*. 


(Here we are ignoring the trivial case where P; = Ni and Pz = No.) If yı = ye, 
the same conclusion follows in a similar way. Suppose, then, that xı # x2 and 
yi * ys, as in the figure. Then the horizontal line through P, intersects the ver- 
tical line through P, in a point Q, and AP,P2Q has a right angle at Q. (Here, and 
hereafter, a horizontal line is X or a line parallel to X; and a vertical line is Y or 
a line parallel to Y.) Thus 


PQ = MM», 


and 
P2Q = NW, 


either because the point pairs are the same or because opposite sides of a rectangle 
are congruent. By the Pythagorean theorem, 


P,P} = PQ? + P2Q?. 
Therefore 
P,P} = M,M3 + N\N3 


= za — til? + lys — vil? 
=. (z2 — 1,)? + (y2 — i», 


and from this the distance formula follows. 
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By a linear equation in x and y we mean an equation of the form 
Ax+ By+C=0, 


where A, B, and C are real numbers, and A and B are not both = 0. By the graph 
of an equation, we mean the set of all points that satisfy the equation. More gen- 
erally, by the graph of a condition we mean the set of all points that satisfy the 
given condition. Thus the interior of a circle with center Q and radius r is the 
graph of the condition PQ < r; and one of our theorems tells us that the perpen- 
dicular bisector of a segment AB is the graph of the condition PA = PB. 


Theorem 2. Every line in E is the graph of a linear equation in z and y. 


Proof. Let L be a line in E. Then L is the perpendicular bisector of some seg- 
ment P;Ps, where P, = (x1, y1) and Pa = (x2, y2). Thus L is the graph of 
the condition 


PP; = PP2. 


Figure 18.3 


With P = ($y) this can be written algebraically in the form 
Ve = 5)! 4 ( — V) = V@— z9* F (y Ya), 


or 


a? — 2ziz + zi y! — Qyy + yi = z? — 2ra + r3 + y? — 2yay + y? 
or 


2(a2 — zi)z + 2(ya — yay + @2+ 43+ zi + yi) = 0. 
This has the form 
Az + By+C=0. 
And A and B cannot both be = 0, because then we would have x2 = 2; and 
Y2 = Yı; this is impossible, because P, # Po. 
Theorem 3. If L is not vertical, then L is the graph of an equation of the form 


y = mz + k. 
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Proof. Lis the graph of an equation 
Az + By +C — 0. 


Here B = 0, because for B = 0 the equation takes the form z = —C/A; and 
the graph is then vertical. Therefore we can divide by B, getting the equivalent 
equation 


This has the desired form, with 


H 


Theorem 4. If Lis the graph of y = mz + b, and (zi, y1), (£2, y») are any two 
points of L, then 


Proof. Since both points are on the line, we have 


y; = mra +k, yı = mz +k. 
Therefore 


ya — y1 = m(z2 — Tı), 


and x2 Æ 2, because L is not vertical. Therefore 


Thus the number m is uniquely determined by the line. It is called the slope of 
the line. 


Theorem 5. Let L and L’ be two nonvertical lines, with slopes m and m’. If L 
and L/ are perpendicular, then = 
1 


m = ——: 
m 


Proof. Let 
Pı = (xy) and Ps = (t2 y2) 


be points of L’, such that L is the perpendicular bisector of PP. (See Fig. 18.3.) 
As in the proof of Theorem 2, L is the graph of the equation 


(ay — 2) + Uys — yay + G3 + ya + at + yi) = 0. 


This has the form 
Az + By+C = 0, 
where 
A=2At2—1), B= 2y2—y)- 
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Therefore 
mic EEI T Sr ere 
B 2(y2 — y1) ya — Yı 
But, by Theorem 4, we have 
m= pA piled A 
tz — 21 
Therefore m^ = — (1/m), which was to be proved. 


Theorem 6. Every circle is the graph of an equation of the form 
z? +y? + Az + By +C = 0. 


Proof. By the distance formula, the circle with center (a, b) and radius r is thẹ 
graph of the equation 


VeE— at y— =r, 


z? — 2az + a? + y? — 2by + b? — r? = 0. 


or 


This has the required form, with 
A = —28, 


B= —2b, 
C= a? + b? — r. 
The converse of Theorem 6 is false, of course. The graph of 
z? +y -0 
is a point, and the graph of 


e+y+1=0 
is the empty set. 
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In proving the following theorems, try to use as little geometry as possible, putting 
the main burden on the algebra and on the theorems of this section. 

1. Show that the graph of an equation of the form 

z? +y + Az-c By+C =0 

is always a circle, a point, or the empty set. 

2. Show that if the graphs of the equations 

y =mr+k, y = moz ke 

are two (different) intersecting lines, then m1 # ma. 


3. Show that if mı = m», then the graphs are either parallel or identical. 
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4, In the chapter on similarity, we defined 
A1, By, C1 ~ A2, B2, C2 


to mean that all the numbers in question were positive and that 


Let us generalize this in the following way. Given Aj, Bi, C1, not all = 0. If there is a 
k # 0 such that 
Az = kA, Bo = kBi, C2 = kCi, 


then we say that the sequences A1, By, C1 and A2, B», C are proportional, and we write 
Ay, By, Ci ~ Ag, B2, C2. 
With this understanding, show that if 
Aww + Byt C17 0 and Aor + Bay 4- C2 = 0 
have the same line L as their graph, then 
A1, Bi, C1 ~ Ao, B2, Co. 
[Hint: Discuss first the case where L is vertical, and then the case where Lis not vertical.] 
5. Describe the graphs of the following equations. 
(a) 22--y?--1--2z-F2y--2zy =0 (9 w= 0 (c) +ay?—2x=0 


Ev 


CONSTRUCTIONS WITH RULER AND COMPASS 


19.1 UNMARKED RULERS AND COLLAPSIBLE COMPASSES 


In the introduction to the foundations of geometry, given in the first few chap- 
ters of this book, we used postulates which fall into six groups: (1) the incidence 
postulates, (2) the separation postulates, (3) the ruler postulate, (4) the “protractor 
postulates," (5) the SAS postulate, and finally (6) the parallel postulate. (Here, of 
course, by the "protractor postulates" we mean the ones dealing with the measures 
of angles.) Our treatment of incidence and separation has been quite standard; 
and so also was one treatment of parallelism. In fact, where these topies are con- 
cerned, the differences between one book and another are mainly in the style of 
exposition and the degree of explicitness. (Some elementary books consider separa- 
tion properties to be unworthy of mention.) 

The ruler and protractor postulates, however, are another matter. These were 
invented rather recently (by G. D. Birkhoff) and the mathematical spirit reflected 
by them is quite different from the mathematical spirit of the Greeks. The basic 
difference, roughly speaking, is that rulers and protractors are used for measuring 
things; we place them on geometric figures and we read off real numbers from the 
scales that are marked on them. Birkhoff’s metric postulates tell us, in effect, that 
we have an “ideal ruler” and an “ideal protractor,” with which we can measure 
segments and angles exactly. The Greeks, on the other hand, considered that meas- 
urement was merely one of the practical arts. It was not considered to deserve the 
attention of mathematicians and philosophers. Just as we have described the 
metric treatment in terms of two drawing instruments, the marked ruler and pro- 
tractor, so we can describe Greek geometry in terms of two different drawing in- 
struments—the unmarked ruler and the compass. The Greeks thought about 
geometry in terms of these two instruments; and they investigated at length the 
question of what figures could be constructed by means of them. 

Before we proceed to consider problems of construction with ruler and compass, 
several warnings are in order. 

(1) When we speak of a ruler and a compass, we mean an “ideal ruler” and an 
"ideal compass,” which draw straight lines and circles exactly. The thickness of 
pencil marks and the approximations involved in draftsmanship will not concern 
us. 
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(2) The Euclidean ruler has no marks on it. We can use it to draw the line 
through two given points, but that is all we can. use it for. We cannot use it to 
measure distances between points, or even to tell whether two segments are 
congruent. 

(3) The Euclidean compass can be used in the following way. Given a point P 
and a point Q (in the plane), we can draw the circle that has center at P and con- 
tains Q. This is all that we can use the Euclidean compass for. That is, given a 
third point P’, we are not allowed to move the spike of the compass to P' and then 
draw the circle with center at P’ and radius PQ. For this reason, the Euclidean 
compass is called collapsible; you can’t move the spike because “when you lift 
the spike off the paper the compass collapses.” Another way of putting it is that 
you can’t use the compass as a pair of dividers. Oddly enough, modern draftsmen 
feel the same way that Euclid did on this rather delicate question. 

(4) In studying ruler and compass constructions, we shall not attempt to build 
the foundations of geometry all over again. In proving that our constructions 
work, we shall make free use of the theorems of metric plane geometry. In particu- 
lar, we shall make continual use of the two-circle theorem. 

(5) Ina construction problem, when we say that a line is “given, " we mean that 
at least two points of the line are given. 

Let us now try a few constructions. 

Construction 1. To construct the perpendicular bisector of a given segment. 

Given two points P, Q. First draw the circle C with center at P, containing Q; 
and then draw the circle with center at Q, containing P: 


Ficure 19.1 


Let PQ — a. Sincea « 2a, it follows that each of the numbers a, a, a is less than 
the sum of the other two. Therefore the hypothesis of the two-circle theorem is 
satisfied. (Most of the time, our verifications of the hypothesis of the two-circle 
theorem will be as trivial as this one.) Therefore C; | and Ce intersect in two points 
R and S, lying on opposite sides of PQ. Therefore RS intersects PQ in a point T. 

Since T lies on RS, and RS is a chord of both circles, it follows that T is in the 
interior of both circles, and both TP and TQ are less than PQ. Therefore T-P-Q 
and P-Q-T' are impossible. Therefore P-T-Q. 
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Now ASPR = ASQR, by SSS. Hence Z1 = Z2. By SAS we have 
APRT = AQRT. Therefore Z5 = Z6, so that RS 1 PQ, and PT & TQ. 
Thus RS is the perpendicular bisector of PQ. This gives us a sort of “corollary 
construction”: 

Construction 2. To bisect a given segment. 

First construct the perpendicular bisector, as above. The point T' is the bisector 


Construction 3. To construct the perpendicular to a given line, through a given 
point on the line. 


Given a line L, and a point X of L. Since L was given, at least one other point 
P of L must be given. Draw the circle C which has center at X and contains P. 


Then C will intersect L in exactly one other point Q. Now construct the perpendicu- 
lar bisector of PQ; this will be perpendicular to L at X. 


P [^ 
Y L 
Figure 19.2 
Construction 4. _Given three points P, Q, R. To construct a rectangle L]PQST, 
such that PT = PR. 
R 
Q 
P \ 
\ 
1 \ 
\ \ 
| L 
loueur $ 
T 
FiavnE 19.3 


First construct the perpendicular L; to PQ at P. Then draw the circle C that 
has center at P and contains R. The circle C will intersect L; in points 7 and T". 

Now construct Lo perpendicular to PQ at Q ; and construct L3 perpendicular to 
L, at T. If L and Lz were parallel, then P7? and PÒ would be parallel, which is 
false. Therefore L; intersects Lg in a point S. We know that each pair of opposite 


sides of []PQST are parallel; and three of its angles are right angles. Therefore 
CIPQST is a rectangle. 
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Figure 19.4 


Construction 5. Given a segment PQ and a ray AB. To construct a point C of 
AB such that AC = PQ. 


Ge 
o 
w 


Fioure 19.5 


First we construct a rectangle OPATU, with PU = PQ. Then AT ~ PQ. 
Draw the circle with center at A, containing T. This will intersect AB in a point 


C; and we will have AC = PQ, as desired. 
Note that once we have Construction 5, we are free to forget about the collapsi- 


bility property of our Euclidean compass. The point is that the ruler and compass, 
in combination, furnish us in effect with a pair of dividers. 
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1. Find a simpler method of doing Construction 5, which works whenever PQ < PA. 
(Hint: First construct an equilateral triangle AP AU.] 

2. Show that if Construction 5 can be done in the case where PQ < PA, then it can 
be done in the general case. 

3. Construction 6. Given AABC, and WB = AB. To construct a point C' on a given 
side of AB, such that AABC = AA'B'C'. 

4. Construction 7. Given Z ABC, PQ, and a side H of TQ. To construct à ray PR, 
with R in H, such that ZABC = ZRPQ. 


218 CONSTRUCTIONS WITH RULER AND COMPASS 


5. Construction 8. Givena line L and point P. To construct the line through P parallel 
to L. 

6. Construction 9. Given AB and a positive integer n. To divide AB into n congruent 
segments. 

7. Suppose that your Euclidean ruler is not a “theoretical ruler,” of infinite extent both 
ways, but a ruler of finite length, say, one inch. Suppose also that the points of your 
compass cannot be spread apart more than an inch. Show that given any two points—no . 
matter how far apart they may be—you can draw the segment between them. 

8. In carrying out Construction 9, you probably found it convenient to draw, at ran- 


dom, a ray AP which was not collinear with AB. Show that this process of random choice 
can be avoided. 


Strictly speaking, random choices of points are not allowed in doing construction prob- 
lems. The reason is rather curious: if they were allowed, then the so-called “impossible 
construction problems” to be discussed later in this chapter would be not quite impossible. 
For example, an infinitely lucky person might manage to pick, at random, a point on a 
trisector of any given angle. 


19.2 HOW TO DO ALGEBRA WITH RULER AND COMPASS 


Suppose that we have given a segment of length 1, and two segments of length 


aand b: 
1 a (5 oe 


Ficure 19.6 


We shall show that all of the elementary operations of algebra for the numbers 
a and b can be carried out with ruler and compass. That is, with ruler and compass 
we can construct segments whose lengths are 


1 


a+b, z ab, T Va. 


a 
(1) The first of these constructions is trivial. On any line L, we lay off a segment 


PQ of length a, and then lay off a segment QF of length b, in such a way that 
P-Q-R. The others require tricks. 


(2) In Fig. 19.7, we have Z1 & Z2, so that AABC ~ AADE. Therefore 


AC _ AE 
AB AD 
If AB = a, AC = 1, and AD = 1, this says that 
1 AE 
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Ficure 19.7 


This indicates a way to construct a segment of length 1/a. We start with any 
ange ZQAR. On AQ we lay off AB with AB = a, and AD with AD = 1. 
On AR we lay off AC, with AC = 1: 


Fieure 19.8 


We now construct a ray DS so that ZADS = Z1. (Any angle can be copied 
with ruler and compass.) You ought to be able to prove, with no trouble, that 


DŠ intersects AR in a point E. The segment AB is the segment that we wanted. 
(3) In the figure below, we have Z1£ Z2, so that APXY ~ APZW. 


"Therefore 


so that PZ = zy. The construction of the figure is exactly as in the preceding case. 
(4) To construct a segment of length b/a, we first find 1/a, and then multiply 
the result by b. 
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(5) Finally, we want a segment of length v/a. First we construct segments PQ, 
S 


f; 


P-AA-Q M a R 


Figure 19.10 


QR, so that P-Q-R, PQ = 1 and QR = a. We bisect the segment. Let M be its 
bisector. With center M and radius 


MP = MR - 15, 


we draw a circle. Next we construct a perpendicular to PR at Q. This line inter- 
sects the circle in two points, one of which is the S shown in the figure. Let z — QS. 
Since APQS ~ ASQR, we have 


QS QR 

PR SQ 
or 

z_¢ 

Te 
or 

z? =a. 


Therefore z = v'a, and QS is the segment that we were looking for. 


19.3 SOLVING EQUATIONS WITH RULER AND COMPASS 


We have found that with ruler and compass we can add, multiply, divide and 
extract square roots, starting with positive numbers. Let us now suppose that we 
have a coordinate system in the plane: 


FicunE 19.11 
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We want to know which points we can plot with ruler and compass, given the 
points with coordinates 1, a, and b on the z-axis. Of course, negative numbers 
have now entered the picture: a and b may easily be negative. But this causes no 
trouble; if we can plot the point with coordinate z, then surely we can plot the 
point with coordinate —z; given z and y, we can plotz—y, y —2, (—2)y, 
z(—y), and so on. 

This means that with ruler and compass we can perform all the operations described 
in the postulates for a Euclidean ordered field. 'That is, we can add, subtract, multiply, 
divide, and extract square roots, in all cases where these operations are algebrai- 
cally possible. Hereafter, when we speak of “plotting a number,” we shall mean, 
of course, plotting the corresponding point on the z-axis. Obviously, if we can 
plot both h and k, then we can plot the point (h, k) in the coordinate plane. We 
merely construct perpendiculars, as in the figure, thus getting their intersection. 
And conversely, if P = (h, k) is given, we can plot h and k by dropping perpen- 
diculars to the axes. 

Y 


(h, k) 
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For this reason, in many cases we can solve algebraic problems by going through 
ruler-and-compass constructions. This process is not merely a stunt. We shall 
use it to solve some construction problems which would otherwise be very difficult 
indeed. 

Problem 1. Given the points with coordinates a, b, c on the z-axis, with b? — 
'4ac > 0. We wish to plot, with ruler and compass, the roots of the equation 


ax? + bx +c = 0. 
These roots are the numbers 


—b + Vb? — 4ac d _ —b — Vb? — 4a, 
ECHOS VE. an ni rr TESSUTI APIS 


2a 


Each of them can be computed, starting from a, b, and c, by a finite number of 
. additions, subtractions, multiplications, divisions and root-extractions. Each of 
these operations can be performed geometrically. Therefore the roots can be 
plotted. 
Problem 2. Given the points on the z-axis with coordinates A, B, C, A’, Bi Gt 
We wish to plot the numbers 23, 1 which are the solution of the system 


Az + By +C = 0, (1) 
A'z + By +C'= 0. (2) 


$47 
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We are interested in the case where the graphs of the equations are nonparallel 
lines intersecting in a single point (x1, y1). This occurs when 


AB' — BA’ x 0. 
Y 


X 


» 


Q) 
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In this case, by the usual elementary methods, we get the solution in the form 


xit — BU 
t Ab pa" 
A'C — AC’ 


n= ABC Ba 


All the operations required here can be done with ruler and compass. Therefore zı 
and y; can be plotted. In fact, if you are actually going to plot x, and y;, there is 
a much shorter method. We claim that if A, B, and C are plotted, then at least 
two points of the line 


Az + By+C=0 


can be plotted. If B = 0, we can set x = 0 and z = 1, getting the points 
(0, — (C/B)), (1, (CC —A)/B). If B = 0, then the corresponding line is vertical, 
and we can plot the points ((—C/A), 0), ((—C/A), 1). Once we have plotted these 
points, we can draw the line that contains them: 


Fieure 19.14 
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Thus, if the coefficients in the equation of a line can be plotted, the line itself 
can be drawn by a rule-and-compass construction. The short way to construct 
the solution of a pair of linear equations, therefore, is to draw the corresponding 
lines and see where they intersect. The same idea leads to an even greater economy 
in more difficult cases, as we shall see. 


Problem 3. Suppose that a, b, r, A, B, and C are plotted. We want to plot the 
common solutions (x1, y1), (%2, Y2) of the equations 


(z— a)? + (y — 0)? = r’, (1) 

At + By +C = 0, (2) 

for the case in which such common solutions exist. We assume, as usual, that A 
and B are not both 0, so that the graph of (2) is a line. 

This one is easy. First we draw the graph of (2) by the method used in the pre- 

ceding problem. The graph of (1) is a circle with center at (a, b) and radius r. 


Since a, b, and r are given as plotted, we can draw this circle. We have now plotted 
whatever intersection points may happen io exist. 


x 


Figure 19.15 


Problem 4. Suppose that A, B, C, D, E, and F are given as plotted. Suppose 
that the graphs of 
z? +y? + Az 4- By - C — 0, (1) 


Dz + Ey +F = 0, (2) 
are a line and a circle, respectively. We want to plot the common solutions (21, Y1), 


(£2, yo), in the cases where such solutions exist. 

The first step, of course, is to draw the graph of (2). The remaining problem is 
to show that the graph of (1) can also be drawn. 

Completing the square in the usual way, we convert (1) to the form 


2 B? A? B? 
| tdrt + tB CTT 


2 a A?4+B?—4C 
(+3) +(v+8) -444-—*. 


or 
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Thus the center of our circle is the point 


(a, b) = (-4.-5. 


r = VA? + B? — 4C. 


All three numbers a, b, and r can be plotted, since A, B, and C can be plotted. 
Therefore the circle can be drawn exactly as in the preceding problem. 


and its radius is 


Problem 5. Given a system of equations 


z? +y? + Ar+ By - € — 0, 
x? +y? + Dr + Ey +F = 0, 


where the coefficients are given as plotted. Again, we want to plot the common 
solutions in the cases where there are any. To do this, we draw the two circles by 
the method used in the preceding problems. 

We kave now gotten involved in a lengthy investigation of relations between 
geometry and algebra. This may seem foreign to the spirit of geometry, but in 
fact it is foreign merely to the spirit of Greek geometry. We shall see that once 
algebra has been introduced, we can get easy solutions of problems that the Greeks | 
found difficult, and we can get difficult solutions of problems which the Greeks 
found impossible. y 


19.4 THE PROBLEM OF APOLLONIUS 
Given three circles C, Co, and C; in the plane. The problem of Apollonius is 


to construct, with ruler and compass, all possible circles C which are tangent to 
all three of the circles C1, C2, and C3. One such C is shown in the figure. 


Bi A 
bi eo 


FiauRE 19.16 


Using the methods of the preceding section, we shall find without much trouble 
that all possible circles C are constructible with ruler and compass. 
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[9 


(1) 
(2) 


68) 
Figure 19.17 


y Given Cı, with center P, and radius r1, and C, with center P and radius r, it 
is rather easy to find the conditions under which the two circles are tangent. If 
the circles are tangent and mutually exterior, then 


PPy =r tT. (1) 
If they are tangent, and C; lies inside C, then 

PP tiai ti (2) 
Finally, if they are tangent, and C lies inside C, then 

PP Se T: (3) 


Conversely, each of these equations implies that the corresponding geometric 
condition holds. We can sum all this up by saying that C and C, are tangent if and 


only if 
PP? = (r +11)”. 


To connect this up with our algebra, we let the centers be the points (a, b), 
(ai, by). Our equations then take the form 


(a — a)’ + (b — bi)? = (rx n). (4) 
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We can now restate the problem of Apollonius. C is tangent to all three of the 
circles C, C2, C3 if all three of the equations 


(a — ay)? + (b — by)? = (r + r)? (4) 
(a — a3)? + (b — ba)? = (r + 12)? (5) 
(a — a3)? + (b — bs)? = (r + r3)? (6) 


hold for some choice of + or — in each equation. 

Here, for each of the eight possible choices of signs, we have a system of three 
equations in the three unknowns a, b, and r. Our problem is to solve these equa- 
tions with ruler and compass, in the cases where solutions exist. 

First we multiply out and collect terms, getting 

a? + b? + r? — 2a1a — 2bjb Arar = r1 — a? — 0. 
This has the form 
a? +b? — r? + Aja + Byb+Cir+ D, = 0, 0) 
where the coefficients are numbers that can be plotted. Similarly we get 


a? + b? — r? + Asa + Bob + Cor + Ds = 0, (8) 
a? + b? — r? + Asa + Bab + Car + Ds = 0, (9) 


where all of the coefficients can be plotted. Subtracting (7), term by term, from 
(8) and (9), we get two equations of the form 


Esa + Fb + Gor + Ha = 0, (8’) 
Esa + Fab + Gar + H; = 0, (9^) 


where all of the coefficients can be plotted, being differences of plottable numbers. 

We now propose to solve for a and b in terms of r. That is, we are going to re- 
gard Gor + Ho and Gar + H3 as constant terms. By the methods referred to in 
problem 2 of the preceding section, we get 


— Fs(Gar + H3) — Fa(Gor + Hə), 


k EF, — Fabs 
p = Fs(Gor + H2) — Fo(Gsr + Ha), 
EF} E FE; 


Here a and b have the forms 
a=Jr+K,,  b-—Jsr-Ks; 


where all the coefficients can be plotted, because the previous coefficients were. 
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We now substitute in (7) these expressions for a and b. Equation (7) now be- 
comes a quadratic equation in r alone, with coefficients all of which can be plotted: 


(Jar + Ky)? + (Jor + Ko)? — r°? + Aq(Jar + Ki) 
+ B,(Jor + Ke) + Cir + D, om 0; 


We can solve such an equation with ruler and compass to plot the point r. We then 
plot a and b. This solves our problem. 

Note that if any of these steps are algebraically impossible, this means that our 
geometrical problem was impossible in the first place. This can easily happen. For 
example, if our three given circles are concentric, then no circle is tangent to all 
of them. 

Various modifications of Apollonius’ problem can be solved by exactly the same 
method. Suppose, for example, that we want to construct all circles C which pass 
through a given point P, and are tangent to two given circles C2, and C3. To solve 


C; c 


P, 


Ficure 19.18 


the problem, we would simply set rı = 0, and then proceed exactly as before; 
similarly, if two points and a circle are given. You can even use the method to 
pass a cirele through three given points, if you want to, but you surely don't want 
to. (Why?) 

It should be understood that the sort of analysis that we have been going through 
does not lead to valid methods in mechanical drawing. In all really complicated 
construetion procedures, there are so many steps that the cumulative error is 
likely to make the final result quite unrecognizable as a “right answer." To keep 
this cumulative error small, we need to do the algebra algebraically. 


19.5 THE IMPOSSIBLE CONSTRUCTION PROBLEMS OF ANTIQUITY 


The sort of methods that we have been using were of course quite unknown to 
the Greeks. Some of the Greek mathematicians (notably Archimedes) were as 
good as any mathematicians who have ever lived. But some easily stated prob- 
lems defeated them completely; and many centuries later the reasons for this 
became clear. It turned out that geometry, in the sense in which the Greeks 
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understood it, is not a self-contained subject and that some of its elementary 
problems require, for their solution, branches of mathematics that the Greeks did 
not manage to discover. 

Probably the most famous of these elementary problems are the trisection of 
the angle and the duplication of the cube. Given an angle, we are asked to construct 
its trisectors with ruler and compass. 


Fieure 19.19 


Given a segment of length a, it is required to construct a segment of length b, 
such that 
ors: 2a 


Ifa = 1, thenb = 4/2. Thus our problem is to plot ~/2 on the x-axis. The Greeks, 
of course, did not put it this way. In purely synthetic terms, given a segment AB, 
we are to construct a segment CD, such that a cube with CD as an edge has twice 
the volume of a cube with AB as an edge. 

It turned out that both these problems are impossible: no such constructions 
exist. The rest of this chapter will be devoted to the proofs of these statements. 
In the following sections, it will seem that we are going rather far afield; we shall, 
because we have to. It was the need for going far afield that made the impossi- 
bility proofs so hard to discover. 


19.6 THE SURD FIELD 


A number z is called a surd if we can calculate x by a finite number of additions, 
subtractions, multiplications, divisions, and extractions of square roots, starting 
with 0 and 1. 

For example, 2 is a surd, because 2 = 1 + 1, and 1 is a surd ex officio. Given 
that n is a surd, it follows by addition that n + 1 is a surd, because the sum of 
two surds is a surd. By induction we conclude that every positive integer is a surd. 
By subtraction, every negative integer is also a surd. By division, we have the 
following: 


Theorem 1. Every rational number is a surd. 
And it is easy to show that: 


Theorem 2. The surds form a Euclidean ordered field. 


To check this, we note first that the associative, commutative, and distributive 
laws automatically hold for surds, since they hold for all real numbers. The same 


| 
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observation applies to the postulates for order. Since we are allowed to form 
surds from other surds by addition, subtraction, multiplication, and division, it 
follows that the set of surds contains sums, products, negatives, and reciprocals of 
allits elements. (Except, of course, that 0 has no reciprocal.) Therefore, the surds 
form an ordered field. Finally, since we are allowed to form surds by extracting 
square roots of positive surds, it follows that the surd field satisfies the Euclidean 
completeness postulate. 

We denote the surd field by S. 

Later we shall see that some numbers are not surds. Granted that there is one 
nonsurd, it follows that there are lots of others. For example, if z is not a surd, and 
p and q are nonzero integers, then y = (p/q)z is not a surd. The reason is that if 
y were a surd, then x would be the product of the surds y and q/p. Therefore x 
would be a surd after all. In general, if a is a surd and z is not, then az is not a 
surd. 

We return now to the coordinate plane. By an S-point (S for surd), we mean & 
point both of whose coordinates are in S. By an S-line, we mean a line which con- 
tains at least two S-points. By an S-circle, we mean a circle whose center is an 
S-point and whose radius is in S. By an S-equation we mean an equation of the 
form 

Axr + By+C=0 
or 
£? +y? + Dr 4 Ey +F = 0, 


in which all of the coefficients are in S. These ideas are connected up by the follow- 
ing theorems. 


Theorem 3. Every S-line is the graph of an S-equation. 


Proof. Let L be a line containing the S-points (zi, Y1), (zs; y). If L is vertical, 
then L is the graph of the S-equation 


g= 2, =—'0. 


If L is not vertical, the slope of L is 


Here m is a surd, because the surds form a field; and L is the graph of the equation 


y — yı = mx — 2), 
or 
mz — y — (—yı + mzi) = 0, 


which is an S-equation. The converse is also true. 
Theorem 4. If a line L is the graph of an S-equation, then L is an S-line. 
Proof. Suppose that L is the graph of the S-equation 
Az + By -C — Q0. 
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If B # 0, then we can set z = 0 and z = 1, getting the S-points (0, — (C/B)), 
(1, —(C + A)/B). If B= 0, then L contains the S-points (—(C/A), 0) and 
(—(C/A), 1). 


Theorem 5. Every S-circle is the graph of an S-equation. 
Proof. Let C be an S-circle with center (a, b) and radius r. Then C is the graph of 


(z— a)? + (y — bP? =r, 
or 
z? + y! — 2az — 2by +a? + b? — r? = 0, 


which is an S-equation. 


Theorem 6. If a circle C is the graph of an S-equation, then C is an S-circle. 


Proof. Given that 
x? +y? + Dr +Ey+F=0 


is an S-equation, and that its graph is a circle. We can then convert the equation 


to the form 
Dy EN? = D? + E? — 4F 
Cy qr ege 
"Therefore the center is the S-point (—(D/2), —(E/2)) and the radius is the surd 
r = 4D? + E? — AF. 


This sort of elementary algebra is easier to write than to read. We therefore 
leave to you the verification of the following theorem. 

Theorem 7. Let P be a point in the intersection of (1) two S-circles, (2) two S- 

lines, or (3) an S-circle and an S-line. Then P is an S-point. 


The reason is roughly as follows. The coordinates z1, yı of P are common solu- 
tions of two S-equations, each of which has the form 


Ac+By+C=0 o az?- y?-- Dz - Ey - F — 0. 


When we solve such a system by the usual algebraic methods, we find ourselves 
calculating x, and y; by starting with the coefficients and then performing the 
operations +, -, —, +, v. These operations keep us within the surd field at every 
stage. Therefore the final results x; and y; must be surds. 


19.7 THE SURD PLANE 


Let E be a coordinate plane. Let E be the set of all surd points in E. The set E 
will be called the surd plane. For each S-line L, let 


L-EÉnL. 
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The sets L are called surd lines. Similarly, if C is an S-circle, let 
C=Enc. 


The sets C are called surd circles. 
As we have rem.:rked before, it will turn out that not all real numbers are surds. 
Therefore E is full of holes. In fact, E does not contain all of any line or any circle. 


Y 


» 


Ix 


| 
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In fact, if xı and y; are nonsurds, then the lines z = 21 and y = y; contain no 
points of E at all. These lines cut the plane (in a manner of speaking) into pieces 
only one point wide. 

On the other hand, if you were to investigate the surd plane by experimenting 
with ruler and compass, you could never tell that any points were missing. The 
only lines and circles you could draw would be S-lines and S-circles. If two S- 
lines, Lı and Lg, intersect in a point, this point is in E. (See Theorem 7, Section 
19.6.) Therefore L; intersects Ls. If an S-line L intersects an S-circle C the inter- 
section points are in E (same theorem). Therefore 

LnO-LncC. 
If two S-circles Cı, C2 intersect, then so also do the surd circles Ci, Cs, in the 
same points. 

Thus, although surd lines and surd circles are “full of holes,” they “never pass 
through each other where the holes are”; they intersect each other everywhere we 
expect them to, We sum this up in the following theorem. 


Theorem 7-1. Every ruler-and-compass construction which is possible in the 

plane is also possible in the surd plane. 

In the light of this theorem, the questions of trisecting angles and duplicating 
cubes become questions of fact, as follows. 

(1) Is it true that, in the surd plane, every angle has a trisector? 

(2) Is V2 a surd? 

If the answer to (1) is “No,” then there is no general method of trisecting angles 
with ruler and compass; the only points that we can construct are the surd points, 
and therefore we cannot construct a line that contains fewer than two surd points. 

Similarly, if 4/2 is not a surd, we cannot construct a segment whose length is 
V2. 

To answer questions (1) and (2), we need to do some algebra. 
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19.8 QUADRATIC EXTENSIONS OF FIELDS. CONJUGATES IN A QUADRATIC 
EXTENSION FIELD 


Let F be a subfield of the real number system. Let k be a positive number be- 
longing to F, and suppose that 4/k does not belong to F. Let 


F(k) = {x + yv klz, y € F}. 


The set F(k) is called a quadratic extension of F. $ 
For example, if F is the field Q of rational numbers, then 2 € F and v2 g F. 
Therefore we can form the quadratic extension 


F(k) = Q(2) = { + yv2lz,y € Q}. 


We found, at the end of Problem Set 1.7, that these numbers form a field; and, in 
fact, this is what always happens. 


Theorem 1. Every quadratic extension of a field forms a field. 


Proof. Let F be a subfield of the real numbers, and let F(k) be a quadratic ex- 
tension of F. The associative, commutative, and distributive laws hold auto- 
matieally in F(k), because they hold for all real numbers. It is also easy to see 
that the numbers of the form x + y/k (z, y € F) are closed under addition and 
multiplication, that 0 is among them (—0 + 0 /k), and that —(x + y /k) is 
always a number of the same form. It remains only to verify that if 


z +yvk #0 
then 
1 


z+ yvk 


Of course we know that the reciprocal exists, because the real numbers form a 
field; the question is whether the reciprocal belongs to F (k). 


€ F(k). 


Lemma. If z+ y /k # 0, then z — yw/k = 0. 


Proof. If x — yk = 0, then z = yw/k. Here x and y cannot both be 0, be- 
cause x + ys/k would then be 0. Therefore neither x nor yis 0. Therefore Vk = 
x/y, which is impossible, because /k € F. 

By the lemma, we can write 


Se te mibi Pi an ens A 
zc yvVk zc yvVk z—yVk 
z—yvk 
| æ? — ky? 


ME x Zy 
mi" sy ey ade 


which belongs to F(k). 
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H z = z + yvk € F(k), 
then the conjugate Z of z is defined by the equation 
z= x — yvk. 


The operation of conjugation in a quadratic extension field is very closely analogous 
to the corresponding operation for the complex numbers. It may be worthwhile 
to review this, to bring out the analogy. 
Given a complex number 
z=r+y, 
we define 
$2. 
The basic properties of the operation z — Z are given in the following theorems. 
Theorem A. The conjugate of the sum is the sum of the conjugates. That is, if 


zı = 2% + yid Z2 = 2 + yet, 
then 
zi + 23 = 74 + Za. 
This is trivial to check. 


Theorem B. The conjugate of the product is the product of the conjugates. 


That is, 
2122 = 220. 


Verification. 1 
zizo = (£1 + yri) (z2 + ys) = tit2 — Vide + (x1y2 + tayi)t, 
ZZ. = (xi — yir. — Yot) = xivs — Y1Y2 — (ziya + z2y1)1- 


Obviously 2122 = 2122- 
By induction we get the following theorem. 


Theorem C. 2” = 2”. 


Theorem D. If a is a real number, then d = a. 
By a polynomial of degree n > 0, we mean (as usual) a function f defined by an 
equation 
fle) = ane” "asi P boo mg + 0 
where a, ~ 0. We allow also the “zero polynomial" which is 0 for every 2. 


Theorem E. If f is a polynomial with all coefficients real, then 


{@ =I@ 


for every z. 
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Proof. Let f(z) be as in the definition of a polynomial. Then 
S(Z) = an?” + a4 12"! + +++ + aZ + ao 
= UP Eg, 7 + +++ + TZ + Ap, 
by Theorem D. This is 
= UP d, az] +--+ 2 H Ap, 
by Theorem C. This is 
= ay + a.i 7l + +++ + Gz + do 
by Theorem B. And this is = f(z), by repeated applications of Theorem A. 
Theorem F. Let f be a polynomial with all coefficients real. If 
fo) = 0, 


fo) = 0. 


That is, if the coefficients are real, then the roots of the equation occur in con- 
jugate pairs 2o, Zo. 
The proof is trivial, because if f(zg) = 0 we have 


fzo) = Jeo) = 0 = 0. 


For quadratic extension fields F(k), we have a precisely similar sequence of 
theorems in which F acts like the field of real numbers, F(k) acts like the field 
of complex numbers, and 4/K acts like i. We shall merely restate the theorems and 
leave to the reader the easy task of verifying that the same proofs work in the same 
way. (The equations in the proof that z;2; = 2,2; take a slightly different form.) 
"Throughout these theorems it should be understood that F(k) is a quadratic ex- 
tension of F, and that conjugates are defined by 


zcyvk-—z— yv. 
Theorem 2. In F(k), the conjugate of the sum is the sum of the conjugates. 


then . 


Theorem 3. The conjugate of the product is the product of the conjugates. 


Theorem 4. 2" = 2", for every z € F(k). 
Theorem 5. If a € F, then 1 = a. 
Theorem 6. If f is a polynomial with all coefficients in F, then 


HORE! 
for every z in F(k). 
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Theorem 7. Let f be a polynomial with all coefficients in F. If z; € F(k), and 
f(zo) = 0, 
fo) = 0. 


Thus, for polynomial equations with coefficients in F, the roots in F(k) occur 

in conjugate pairs. 

Theorems F and 7 describe a phenomenon which has long been familiar from 
elementary work with quadratic equations. The roots of the equation 


then 


az? + bz --c — 0 
Qo mbs VUE dac. 
xs 2a 


are given by the formula 


If b? — 4ac = —d < 0, then the roots are the complex numbers 


bus udis 
3a " 9a 


which are conjugate, as predicted by Theorem F. 
Suppose now that a, b, and c are rational, and b? — 4ac = e > 0. The roots 


are then the real numbers 
—b 1 


These are conjugate elements of the quadratic extension field F(k) = Q(e), as 
predicted by Theorem 7. 


19.9 SURD FIELDS OF ORDER n; SURDS OF ORDER n 


Suppose that we have an ascending sequence of fields, starting with the rationals 
and proceeding by a quadratic extension at every step. Thus our fields are 


Pop Fi; 51 Fay 


where 
Fo =Q 


Fig1 = FG). 


Here, for each i, kj 41 is in F;, but Vki41 is not. In this case we say that Fn is a 
surd field of order n. 


and 


Theorem 1. All elements of F, are surds. 


Because they are obtainable from rational numbers by a finite number of opera- 
tion +,-, —, +, V . (For a formal proof, we would use induction: all elements of 
Fo are surds; and if F; has this property, so does F;41.) The converse is also true: 
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Theorem 2. Every surd belongs to a surd field of some order. 


Proof. Every surd x can be built from rational numbers by a finite number of 
operations +, +, —, +, vV . If nroot-extractions are needed in this process, we say 
that x is a surd of order n. Thus every rational number is a surd of order 0; 4/2 is 
a surd of order 1, and so on. 

Given that x is a surd of order n, let 


ki, ka, .., kn 


be the numbers whose roots we extracted in forming z, in the order in which these 
roots were extracted. Let 
Fy =Q, Fy = Folks), 
and in general 
Fişi = Fu). 


Between the ith root-extraction and the (i + 1)st, we may have used +, -, —, 
and +, but these operations can all be performed in F;. Therefore all the numbers 
formed in the intermediate stages are in the fields Fo, .. . , Fn; and in particular, 
x € Én, which was to be proved. 

Note that all of the indicated quadratic extensions are genuine; we really have 
V k;i41 € Fi, because if /k;, 4 € Fi, we could reduce the number of root extrac- 
tions used in forming z, and x would not be a surd of order n after all. 

Note also what Theorem 2 does not say: it does not say that there is one particu- 
lar F, which contains all surds of order n. In fact, the latter statement is not true; 
in general, F, depends not merely on the order n of x, buf also on z. For example, 
V2 and 4/3 are surds of order 1, but no one field F; contains both of them, be- 
cause 4/2 +- 4/3 is a surd of order 2. 
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1. Show that if nm is even and n is odd, then m is even. 
2. Show that 4/3 is irrational. 
3. Now prove the statement, made in the last sentence of this section. That is, prove 


that no quadratic extension Q(k) of the rational numbers contains both 4/2 and 4/3. 
19.10 APPLICATIONS TO CUBIC EQUATIONS WITH RATIONAL COEFFICIENTS 
Given a cubic equation 
f(z) = 2 + age” + aiz + ao = 0, 


where the coefficients are real numbers. We recall from the theory of equations 
that the polynomial on the left always has a factorization of the form 


g(2) = (z — 21)(2 — 29)(2 — z3) = 0, 


where 2}, z2, and zg are the roots. They may not all be different, and two of them 
may be complex. Why is it impossible for exactly one of them to be complex? 
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(For a full review of the algebraic background of this section, including proofs 
of all theorems cited here without proof, see Chapter 30 at the end of the book.) 
This gives 


g(z) = 22 — (z1 + za + 23)2? + (2122 + 2123 + 22%3)2 — Z12223 = 0. 


Now f(z) = g(z) for every z. The only way this can happen, for polynomials, is 
for the corresponding coefficients to be equal. Therefore, in particular, we have 


— (zı + 22 + 23) = a2. 


Suppose that a, b, and c are in a subfield F of the real numbers and that the 


equation 
2 +a +bz+c=0 


has a root 21, in a quadratic extension F (k) of F. Then 2; is also a root, by Theorem 
7 of the preceding section. If the third root is zs, then we have 


—(z + 2, + 23) = a, 
or 
zg = —(zı + 2 + a). 


Now z; +2, € F. (Why?) Therefore zg is in F. Thus we have the following 
theorem. 


Theorem 1. Given an equation, 
2 -F az? -- bz -- c — 0, 


with coefficients in a field F. If the equation has a root lying in a quadratie ex- 
tension of F, then the equation has a root lying in F. 


If the coefficients are rational, we can draw a stronger conclusion. 
Theorem 2. Given a cubic equation, 
z? -- az? -- bz 4- c — 0, 


where the coefficients are rational. If the equation has a root in the surd field, 
then it also has a rational root. 


Proof. Suppose that a surd z;, of order n, is a root. Then we have a sequence, 
Fo, F1, Bays «Fo 
of quadratic extensions, as in the preceding section, with 
Fo = Q, 


Fiji = Filki+1), 
for every 1, and z; € Fn- 


238 CONSTRUCTIONS WITH RULER AND COMPASS 
Now a, b, c € Q, so that our coefficients are in F,-1. By Theorem 1, our equa- 


tion has a root in Fa—ı. Repeating the same reasoning another n — 1 times, we 
conclude that our equation has a root in Fo. Since Fo = Q, this proves the theorem. 


19.11 THE TRISECTION OF THE ANGLE 


Some angles can be trisected with ruler and compass. For example, a right angle 
can be trisected: 
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Given a right angle ZA. Draw a circle with center at A, with any radius a, 
intersecting the sides of Z A in points B and C. Draw the circle with center at B, 
containing A. The two circles will interseet in two points, one of which will be 
a point D in the interior of ZA. (Both points of intersection will lie on the same 
side of AC as B; and exactly one of them will lie on the same side of AB as C.) 
Now AABD is equilateral, and hence equiangular. Therefore mZBAD = 60. 
Therefore mZ DAC = 90 — 60 = 30 = 4-90, and ZDAC is a trisector of 
ZBAC. To get the other trisector, we draw the circle with center at C, contain- 
ing A, and join A to the point where this circle intersects our first circle in the 
interior of ZBAC. 

For some angles—in particular, for angles of 60°—no such construction is pos- 
sible. The proof is as follows. 

The surd plane contains an angle of 60° because the surd plane contains an 
equilateral triangle. Now any angle can be “copied” with ruler and compass. (See 
Construction 7 of Section 17.1.) It follows that there is a 60° angle ZBAC, with 
AC as the positive x-axis and B in the upper half plane, as in the figure below. 


Y 


A G 3 


Figure 19.22 


THE TRISECTION OF THE ANGLE 239 


We have proved that the surd plane is a ruler-and-compass geometry. Surd 
lines and surd circles always intersect each other in the same way as the corre- 
sponding lines and circles in the complete plane. If there is a method of trisecting 
every angle with ruler and compass, then this general method must apply to ZBAC 
in particular; and the construction can be carried out in the surd plane. Thus we 
arrive at the following conclusions. 

(1) (?) The surd plane contains an angle ZDAC, with degree measure = 20. (?) 


Figure 19.23 


We shall show that this is impossible. It will follow that there is no general 
method for trisecting angles with ruler and compass. We have given that D is a 
point of the surd plane. Let F be the foot of the perpendicular from D to the z- 
axis. Then F is a surd point; the y-coordinate of F is 0, and the z-coordinate is the 
same as that of D. The distance between any two surd points is a surd. Therefore 
the number 


v= 4D 


is a surd. But — cos 20°. Thus we conclude that 
(2) (2) cos 20° is a surd. (?) 
We now need to do a little trigonometry, as follows: 


cos 30 = cos (20 + 4) 
= cos 20- cos 0 — sin 20 - sin 0 
= (cos? 0 — sin? 8) cos 0 — 2 sin 0 cos 8 sin 8 
= (2 cos? 6 — 1) cos 9 — 2(1 — cos? 0) cos 0 
= 4 cos? 0 — 3 cos 6. 
Setting 0 = 20, and recalling that cos 60° = 4, we get 
= 4cos* 20° — 3 cos 20°, or 8 cos? 20° — 6 cos 20° — 1 = 0. 
Thus we have 
(3) cos 20° is a root of the equation 


sy? — by —1 — 0. 
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Here we omit the question marks, because (3), unlike (1) and (2), is actually 
correct; it is not merely a statement whose consequences we propose to in- 
vestigate. 

Setting z = 2y, we conclude that 

(4) the number 4 cos 20° is a root of the equation 


z? — 3z— l= 0. 


We shall prove that no surd is a root of this cubic equation. This will mean 
that 3 cos 20° is not a surd, so that cos 20° is not a surd. This will mean that 
(2) is impossible, so that a ruler-and-compass trisection of a 60° angle is im- 
possible. 

It is easy to see that our cubic has no rational root. If p/q is a rational root, 
expressed in lowest terms, then each of the integers p and q is a divisor of 1 and 
hence is —1 or = —1. Therefore 1 and —1 are the only possible rational roots, 
and neither of them works. Therefore our cubic has no rational roots. By Theorem 
2, Section 19.10, it follows that no root of the equation is a surd. 

The impossibility of this classical problem is surprising to most people when they 
first hear of it. In fact, the world is full of well-intentioned people who simply 
refuse to believe it and go on trying to devise a method. It is easy to see why 
people feel this way. 

In the first place, the proof is much too hard to be capable of popularization. 
For this reason, amateurs are in no position to understand why it is that their 
enterprise is impossible. In the second place, an informal statement of the problem 
is misleading. It sounds as if we are saying that something or other “cannot be 
done”; and many times in the past, defeatist answers to questions like this have 
turned out to be false. For example, people said that flying machines could not be 
built and that matter could not be created or destroyed. People went on saying 
this sort of thing, with complete confidence, until the supposedly impossible things 
were done. If you think of the trisection problem in these terms, then you may 
believe that you can do it, if only you are more ingenious and more persistent than 
the people who have tried it and failed. 

But once the problem is given an exact formulation, the negative answer seems 
much more natural. In the surd plane, all ruler-and-compass constructions are 
possible. Therefore, if a figure can be constructed with ruler and compass, this 
means that in the surd plane, the figure exists. Thus, if angle trisection were al- 
ways possible, we would have the following theorem. 


(?) Theorem. In the surd plane, every angle has a trisector. 


If you think of the problem in these terms, then the negative answer is not sur- 
prising in the least. The proposed theorem describes a completeness property of 
the surd plane; it says that when you look for rays in certain places, you will find 
them. But since the surd plane is “all full of holes,” no kind of completeness con- 
dition has any plausibility until it is proved. Only the most incorrigible optimist 
would be surprised if he looked for a ray, in such a “geometry” as this, and failed 
to find one. 
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19.12 THE DUPLICATION OF THE CUBE 


Another of the impossible construction problems of antiquity is the duplication 
of the cube. Given a segment AB, we want to construct a segment CD such that 
a cube with CD as an edge has exactly twice the volume of a cube with AB as an 
edge. 

As before, if this construction is possible, it is possible in the surd plane. Sup- 
pose then that A and B are surd points. Granted that CD can be constructed, 
C and D must be &urd points. Thus the distances AB and CD are surds; and under 
the conditions of the problem we must have 


CD? = 24B, 


CD\ ou 
It follows that 


(1) (?) The equation zè? — 2 = 0 has at least one surd as a root. (?) 

But this is impossible. The only possible rational roots of this cubic are 1, —1, 
2, and —2. None of these work. Therefore no rational number is a root. There- 
fore no surd is a root. 


or 
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FROM EUDOXUS TO DEDEKIND 


20.1 PROPORTIONALITIES WITHOUT NUMBERS 


In the early chapters of this book we have distinguished between two different 
approaches to the concept of congruence for segments. In the metric approach, a 
distance function 

d: SX S—5R 


is given. Here the fundamental notion is that of distance: to each pair of points 
A, B there corresponds the real number AB, which is the distance between them. 
Congruence for segments is defined in terms of distance. The basic definition 
states that AB œ CD if AB = CD. The properties of congruence now become 
theorems proved on the basis of the metric definition. 

The alternative purely synthetic approach takes congruence for segments as & 
basic idea, left undefined, and governed by certain postulates. In this treatment, 
the idea of distance does not appear at all; in fact, the only numbers that appear 
are the natural numbers 1, 2, .... We may think of congruence for segments as 
the idea f “same distance.” 

The treatment of similarity, in this book, has been strictly metric. We recall 
that a correspondence 


ABC e DEF 


is called a similarity if corresponding angles are congruent and the lengths of cor- 
responding sides are proportional. 


B E 
c a f. d 
A b C D e F 
ZAzZD, LB=ZE, £C=ZF, 
a, b, c ~d, e, f. 
FicunE 20.1 


It is very easy, of course, to treat angle congruence in a purely synthetic way; 
we have discussed this in Chapter 8. But proportionality seems to be quite another 
242 
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matter. We defined the expression a, b, c ~ d, e, f to mean that 
t 


Snc o 


oie 


Here the indicated divisions make sense, because a, b, c, d, e, and f are positive 
real numbers. We say that the sides of AABC and ADEF are proportional if 
the lengths of these sides are proportional in the sense that we have just defined. 

This treatment of similarity is by now very nearly universal, even in books 
which use a strictly Euclidean approach insofar as practicality permits. In nearly 
every elementary book on “synthetic” geometry there is a page on which the idea 
of distance is introduced, in order to permit a metric treatment of proportionality. 

Indeed, if you are not allowed to measure distances and perform divisions, it 
is not so easy to see how you could even explain what is meant by proportionality 
for segments, let alone prove anything about how proportionalities work. Never- 
theless, this can be done, using no numbers at all except the positive integers. The 
method is used in Euclid’s Elements; and the mathematical ideas that make it 
work are attributed to Eudoxus. This suggests two questions. 

(1) What were the purely synthetic ideas that Eudoxus used as a substitute for 
algebra? 

(2) Given a purely synthetic geometry, how can we define a distance function 
satisfying the ruler postulate? 

The first of these questions may seem to be of purely historical interest. But 
this is hardly true of the second: we need to answer it, to introduce coordinate 
systems. Moreover, it turns out that the two questions are so closely related that 
if you answer one of them, the other becomes easy. 

And the ideas of Eudoxus took on a new importance in the nineteenth century, 
when Richard Dedekind found, to everybody's astonishment, that they were 
exactly what was needed in setting up the foundations of the real number system. 
For these reasons, the purposes of the present chapter are only incidentally ar- 


chaeological. 


20.2 EUDOXUS' SYNTHETIC DEFINITION OF PROPORTIONALITY 


We shall work our way gradually toward Eudoxus’ idea, starting with the metric 
concept which we know, and gradually stripping it of its algebraic apparatus. 
To start, we define the expression 


AB:CD::EF : GH (1) 
to mean that 
AB, CD ~ EF, GH; (2) 
that is, 
EF GH 
Ai a LR 3 
i a (3) 
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The first of these expressions is pronounced “AB is to CD as EF is to GH.” We 
propose to find a way to define this expression without mentioning any numbers 
except positive integers. Of course, it can easily happen that the segments AB and 
EF are incommensurable. In this case, the proportionality constant 


_ EF _ GH 
AB OD 
will be irrational. Our first step toward Euclid will be to express Condition (3) in 
the form of a statement about rational numbers. In the light of the comparison 
theorem, this can be done as follows. 
(4) If p/q is rational and 


p 
then 1 
p 


Conversely, if the second of these inequalities holds, then so does the first. This in 
turn can be expressed without mentioning division. 
(5) If p and q are positive integers, and 


p: AB < q- EF, 
p:CD < q- GH. 


then 


And conversely, if the second of these inequalities holds, then so does the first. 
We are now almost done, because (5) is very close to being a statement about 
segments, We recall that addition can be defined for congruence classes of segments. 
For each segment XY, we let [XY] be the set of all segments that are congruent to 
XY. Given any two segments AB and CD, we take points X, Y, Z such that 


X-Y-Z, AB&XY, CD=Yz; 
and we then define the sum [AB] + [CD] by the formula 
[AB] + [CD] = [XZ]. 


We showed, in Chapter 8, that the sum 1 of the two congruence classes really de- 
pends only on the congruence classes [AB] | and [CD], and is independent of the 
choice of A, B, C, D, X, Y, and Z. When AB c CD, we write 


[AB] + [AB] = 2 - [4B]. 
And for any positive integer n, we use the shorthand 
n{AB] = [AB] + [AB] + --- + [AB]. (ton terms). 


Thus, if we write LA 
n{AB] = [XZ], 
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this means that if you take n | congruent copies of AB and lay them end to end, you 
get a segment congruent to XZ. 
In Chapter 7, we defined the expression 


AB < CD 
to mean that there is à point B’, between C and D, such that AB = CB’. This, 
of course, is a purely synthetic way of conveying the idea that AB < CD. 


We are now ready to give Eudoxus’ formulation of (5). Tt reads as follows. 
(6) Let p and q be any positive integers. If 


p{AB] < EF), 
then 3m ux 
pD) < [GH]. 


And conversely, if the second of these inequalities holds, then so does the first. 
This was Eudoxus’ working definition of the statement 


AB:CD: : EF : GH. 


Euclid used it, throughout the Elements, every time he had to deal with propor- 
tionality (except in the commensurable case). This was a really extraordinary 
tour de force, because even the simplest theorems under this scheme become rather 
formidable. Every proportionality that we write down becomes a very complicated 
statement about what happens when congruent segments are laid end to end. 

For example, Proposition 4 of Book V of the Elements reads as follows. 


Propositon 4. “Tf a first magnitude has to a second the same ratio as a third to 
a fourth, any equimultiples whatever of the first and third will also have the 
same ratio to any equimultiple whatever of the second and fourth respectively, 


taken in corresponding order."' 
We can rewrite this in the following form. 


TS 


Proposition 4’. If AB :CD:: EF : GH, and m and nare any positive integers, 


then hai =e: i OE 
mAB :mEF : :nCD : nGH. 


Here mAB denotes any segment in the class mA B), and so on. 
The corresponding algebraic theorem is very simple. It says that if 
AB,CD ~ EF, GH, 
and m and n are any positive integers, then 
mAB, mEF ~ nCD, nGH. 


Writing these proportionalities as equations between fractions, we get the follow- 
ing proposition. 
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Proposition 4". If 
ER _ GH 
AB = UD 
and m and n are positive integers, then 
nCD _ nGH 
mAB  mEF 


The first of the equations means that 
EF -CD = AB- GH; 
and the second means that 
mnEF -CD = mnAB - GH. 


Thus the “proof” doesn’t amount to much. All the theorems in Book V of the | 
Elements evaporate in the same way, as soon as they are interpreted algebraically. 
This, however, does not mean that Euclid was being silly. The point is that in 
Euclid’s time the algebra of the real numbers had not yet been discovered. Getting 
along without it in the study of geometry was a formidable achievement indeed. 

We observed, when we were regarding congruence synthetically, that we could 
not talk about distance; congruence involves only the idea of same distance. Much 
the same thing happens if we use a purely synthetic treatment of proportionality. 
We cannot speak of the ratio AB : CD. All we can say is that AB : CD and EF : GH 
are the same ratio. The easiest way to see this is to observe that the ratio AB : CD 
would have to be a real number; and in purely synthetic geometry the only num- 
bers are the positive integers. 

Finally, we should confess that in giving what are supposed to be Euclid’s for- 
mulations of certain ideas, we have made no attempt to copy his literary style. 
In the Heath translation, his definition of proportionality is as follows. 

“Magnitudes are said to be in the same ratio, the first to the second and the third 
to the fourth, when, if any equimultiples whatever be taken of the first and third, 
and any equimultiples whatever of the second and fourth, the former equimultiples 
alike exceed, are alike equal to, or alike fall short of, the latter equimultiples 
respectively taken in corresponding order.” 

This is the statement which we gave above, in a rewritten form, as Condition (6). 

Our account of Book V has also been simplified in other ways. For one thing, 
Euclid also gave a synthetic formulation of the statement that the ratio AB:CD 
is less than the ratio EF : GH. For another thing, Euclid did not use, even tacitly, 
the Archimedean postulate. He merely provided that segments had to behave in 
an Archimedean fashion for one to be able to talk about proportionalities between 
them. That is, to write Lp 

AB :CD :: EF : GH, 
you must first know that 
(1) »[AB| > (CD) for some p, 
(2) [CD] > [AB] for some q, 
(3) r[EF] > [GH] for some r, and 
(4) SGH] > [EF] for some s. 


THE ALGEBRA OF SEGMENT ADDITION 247 


At no point did Euclid commit himself on the question whether these conditions 
held for any two given segments; he merely announced that he was not going to 
talk about ratios except in the cases where they did hold. The resulting treatment 
was subtle and delicate in the extreme. The fact that such a program was carried 
out, with only occasional slips in matters of detail, reminds us that the Greeks 
were no more primitive in mathematics than they were in the arts. 
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The following is a selection of propositions from Book V of the Elements. Interpret 
each of these propositions algebraically, and prove the resulting theorem. This problem 
set is designed to convince you that the language of mathematics has progressed in im- 
portant ways. 

Proposition 5. “If a magnitude be the same multiple of a magnitude that a part sub- 

tracted is of a part subtracted, the remainder will also be the same multiple of the re- 

mainder that the whole is of the whole.” 


Proposition 6. “If two magnitudes be equimultiples of two magnitudes, and any mag- 
nitudes subtracted from them be equimultiples of the same, the remainders also are 
either equal to the same or equimultiples of them.” 

Proposition 9. “Magnitudes which have the same ratio to the same are equal to one 
another; and magnitudes to which the same has the same ratio are equal.” 

Proposition 15. “Parts have the same ratio as the same multiples of them taken in 
corresponding order.” 

Proposition 19. “If, as a whole is to & whole, so is a part subtracted to a part sub- 
tracted, the remainder will also be to the remainder as whole to whole.” 


Proposition 24. “If a first magnitude has to a second the same ratio as a third has to 
a fourth, and also a fifth has to the second the same ratio as a sixth to the fourth, the 
first and fifth added together will have to the second the same ratio as the third and 
sixth have to the fourth.” 

Proposition 25. “If four magnitudes be proportional, the greatest and the least are 
greater than the remaining two.” 


(This last one is very ambiguous. You should try to find an interpretation that makes 
it true.) 


20.3 THE ALGEBRA OF SEGMENT ADDITION 


We shall not give a full development of the Euclidean theory of proportion, 
but we shall need to know some of the simplest facts about what happens when we 


lay segments end to end. 
Theorem 1. The Commutative Law. 
[4B] + (CD) = (CD) + | AB}. 


This follows from the definition of segment addition. The order in which the 
segments were named never even seemed to matter; the same is true of the next 


theorem. 
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Theorem 2. The Associative Law. 
(LAB] + (CD]) + [EF] = [AB] + ((CD] + [EF]). 


Theorem 3. The Distributive Law. For every positive integer n, 
n([AB] + [CD]) = n[AB] + n{CD]. 


Proof. For each n, let P, be the proposition given by the formula. Then P; is 
true. To prove the theorem by induction, we need to show that if P, is true, then 
so also is P4441. Now 


(n + 1)((AB] + [CD]) = n((AB] + (CDI) + ((4B] + (CDI) 
= (n[AB] + n[CD]) + ((4B] + [CD]) 
= (n{AB] + nCD]) + ((CD] + [4B]) 
= n[AB] + ((n[CD] + [CD]) + [AB] 
= n[AB] + ((n + 1) [CD] + [4B]) 
= n[AB] + ((4B] + (n + 1)[CD)) 
= (n[AB] + [AB]) + (n + [CDI 
= (n + DIAB] + (n + ICD). 

What is the reason for each step? 


Theorem 4. Preservation of Order. If 


[AB] > [CD], 
then 

n[AB] > n[CD] 
for every positive integer n. 


Proof. If [AB] > [CD], 
then Sai 
[AB] = [CD] + [EF] 


for some segment HF. (Recall the definition of > for congruence classes.) By the 
preceding theorem, set 3 
n{AB] = n{CD] + n[EF], 


which is > n[CD], as desired. 
Theorem 5. If EE 
n[AB] > n[CD], 


for some n, then E 
[AB] > [CD]. 
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(if not, we would have a contradiction of the preceding theorem.) It is also 
easy to see the following theorem. 


Theorem 6. If A-B-C, then 
n[AC] > n{AB] 
for every n. 


The reason is that if A-B-C, it follows that [AC] > [AB]. 


20.4 HOW TO DEFINE RATIOS: THE SUPREMUM 


We have observed that in the Euclidean theory of proportionality, we cannot 

talk about ratios; we can only talk about the relation of same ratio. The relation 
AB :CD::EF : GH 

says that AB and CD are in the same ratio as EF and GH; but the “ratios” AB:CD 
and EP : GH have no meaning at all when they stand alone. If the real number 
system is available, however, we can assign a meaning to AB : CD without using 
metric geometry. As a guide in framing the definition, let us recall that the ratio 
should turn out to be the number AB/CD. Thus, by the comparison theorem, we 
can say 


: « AB:C a) 
if and only if 
P< a. (2) 
This is equivalent to 
pCD < GAB, (3) 
which in turn is equivalent to 
pD) < AB). (4) 


Thus p/q < AB :CD if and only if p copies of (CD), | id end to end, form a seg- 
ment shorter than a segment formed by q copies of [CD], laid end to end. 
Let 


K- fe | pCD) < AB} 


Officially, this set of rational numbers has been defined synthetically, without, 
reference to distance. Unofficially, we observe that in the metric scheme it must be 


true that 
_ {ple < ABl. 
x-(2|2< op 
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The relation between the set K and the number AB/CD is simple; it can be con- 
veyed by the two conditions that follow. 

(1) AB/CD is an upper bound of K. That is, every element of K is € AB/CD. 

(In fact, every element of K is strictly less than AB/CD. But this stronger 
condition is not required in the general definition of an upper bound of a set of 
numbers.) 

(2) AB/CD is the least of all the upper bounds of K. That is, every other upper 
bound of K is greater than AB/CD. 

If a set K of numbers and a number s are related in this way, we write 


s = sup K, 


and we say that s is the supremum, or the least upper bound of K. To repeat: 
s = sup K if (1) s is an upper bound of K, and (2) no number less than s has 
this property. 

More examples of this follow. Let 


Ki= L n à positive integer] . 


Here 
sup K, = 1; 


and sup K, belongs to K,. On the other hand, if 


Ki = b — i n à positive integer , 
then 


sup Ko = 2; 


and sup Ke does not belong to Ks. If Kg is the set N of all positive integers, then 
there is no such thing as sup K3, because Kg has no upper bounds at all. 

To define ratios of segments, in terms of Euclid's scheme, we need the following 
two basic postulates, one dealing with geometry and the other dealing with the 
real number system. 


Tue AncHIMEDEAN PosrULATE. Given any two segments AB and CD, there is 
a positive integer n such that 


n[AB] > [CD]. 


(We recall that Euclid's theory of proportionality was restricted to pairs of 
segments that behaved in this way.) 


Tur DEpEKiND PosruLATE. Given a nonempty set K of real numbers. If K 
has an upper bound, then K has a supremum sup K. 


This is our final and crucial postulate for the real number system. To indicate 
what it means, as a completeness condition, we shall explain why it fails to hold 
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in the rational number system. Consider, for example, the set 
K= fe 
q 


This can be described purely in terms of rational numbers: 


p,q >0 and ? « val. 


2 
UD D 
x - |» o and 2 <2} 


In the real number system, K has a least upper bound sup K = /2. But K has 
no sup in the rational number system. The reason is that if r/s is an upper bound 
of K, and 
t T 
V2 < a < 3 

then £/u is also an upper bound of K. Thus no rational upper bound of K is smaller 
than all other rational upper bounds of K. 

In the same way, if we delete from the real number system any one number z, 
the resulting set R’ of numbers does not satisfy the Dedekind postulate. The 
reason is that in the reduced system R’, the set 


K = (yy < 3) 


has many upper bounds, but none of them is a least upper bound. (If zis an upper 
bound, then any number between z and z is an upper bound.) 
Given two segments AB, CD. Let 


K- t | p(CD] < Bi} 
On the basis of the Archimedean postulate, we shall prove the following theorems. 


Theorem 1. K contains at least one positive number p/q. 


Theorem 2. K has an upper bound. 
From these two theorems it will follow, by the Dedekind postulate, that: 


Theorem 3. K has a least upper bound sup K, and sup K > 0. 
These theorems will justify the following definition: 


DEFINITION. i AGE ae 
AB :CD = sup K; 


And we shall then know by Theorem 3 that: 


S| 


P! 


AB: 


> 


Theorem 4. For every two segments AB, C 
> 


Sl è 
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Let: us now prove Theorems 1 and 2. 


Proof of Theorem 1. Given AB and CD. By the Archimedean postulate, there 
is an n, such that Mok my 
n{AB] > [CD]. 
Let p = 1 and let g = n. Then p/q is positive, and belongs to K. 


Proof of Theorem 2. Given AB and CD. By the Archimedean postulate, there 
is a number n, such that yi a ES 
n|CD] > [AB]. (1) 


We assert that n is anupper bound of K. We shall prove this by showing that if 
p/q > n, then p/q does not belong to K. 
If p/q > n, then p > nq. Therefore 


pICD] > nglCD]. 
It follows from (1), by Theorem 4, Section 20.3, that 


ndCD| > AB). 
"Therefore E ih 
pICD|] > q[AB], 


so that p/q does not belong to K. 
To see where the following theorem comes from, let us first state its metric form, 
which is trivial. 


Theorem. If A ~ B, and C-D-E, then 


CE _ OD, DE. 
AB AB ' AB 


The reason is that CD + DE = CE. 
The corresponding synthetic theorem is not trivial. 


Theorem 5. The Addition Theorem. If A »* B, and C-D-E, then 
CE : AB = CD: AB + DE: AB. 


The proof is both long and tricky. It will be given in the next section. 


20.5 PROOF OF THE ADDITION THEOREM 


Obviously under our definition of ratios, the addition theorem is a statement 
about sup’s. To prove the theorem, we must first interpret it in terms of the 
definition. 
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Given A # B and C-D-E. Let 


Kı = fe | p(AB] < aD} 


so that Sad 

CD : AB = sup K; = kı. 
Let 

Kee [ r[AB] < «DE) " 
so that 


Finally, let 
K= (t| (AB) « (EC) 
so that D s 
CE: AB = sup K = k. 
The addition theorem states that 
ky + ke = E: 


The proof is rather long. We have tried to make it as easy as possible by split- 
ting it up into a series of subsidiary theorems, each of which has a rather short 
proof. You are warned, however, that unless you have encountered this sort of 
proof before, it is going to be confusing. On the other hand, the proof will repay à 
great deal of study, because the ideas that come up in it are quite fundamental in 
analysis. 

First we shall state a theorem. 


Theorem 1. If p/q belongs to K, and r/s < p/4, then r/s belongs to K. 
Proof. Given LE ie c 
p{AB] < [CE] 
and 
qr < ps, 


we need to show that m e 
1[AB] < s{CE]. 


By Theorem 4, Section 20.3, we have 


pr|AB] < qriCE], 
and de DAE 

qrCE] < psiCE], 
because qr « ps. Therefore A Zr 

pr(AB] < psiC E]. 


Now if it were true that r[AB] = s[CE], then we would have pr[AB] = ps(CE]; 


and if r[AB] > s(CE], then priAB] > ps[CE]. In each case, the consequence is 
false. 
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Therefore 
r[AB] < s[CE], 

which was to be proved. 

Thus the set K is of a very special type, called a cut in the rational numbers. 
To be exact, a set Z is called a cut in the rational numbers if 

(1) Z is a set of positive rational numbers, 

(2) Z is not empty, 

(3) Z has an upper bound, and 

(4) if à 

o<2<f; 
qi 8 

and r/s belongs to Z, then p/q belongs to Z. 

In this language, we can state Theorem 1 in the following form. 


Theorem 2. If AB and CE are any two segments, and 


K= fe | plAB] < c) 


then K is a cut in the rational numbers. 


Theorem 1 tells us that K satisfies (4), and we know already that K satisfied 
the other three conditions of the definition. 
The following theorem is logically trivial, but very useful. 


Theorem 3. If z = sup Z, and e is any positive number, then some element z 
of Z is greater than z — e. 


Proof. If there were no such z, then every element z of Z would be Sz — e. There- 
fore z — e would be an upper bound of Z. This is impossible, because sup Z is the 
smallest of all of the upper bounds of Z. 


Combining Theorems 2 and 3, we get a simple description of the set K. 


Theorem 4. Let AB and DE be segments. Let 


k= [e| nazi < DB} 
Let 


k — sup K. 
Then 


k 


That is, K contains all positive rational numbers that are less than k. 
Proof. Let r/s be a rational number between 0 and k. Let 


o<t <a] cK, 


e-k-—t. 
8 
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Then e > 0. By Theorem 3 there is an element p/q of K such that 


: >k—e. 
This means that 


By Theorem 1, r/s belongs to K, which was to be proved. Thus we can write 


p p | 

fplo<2<k Cc Ky, 
t 
s 
{i 


We now return to our geometry. 


0 «5 «x ck. 


Es 


t 
Jo<tcilcx. 


Theorem 5. If p/q belongs to Kı and r/s belongs to K3 then p/q + r/s belongs 
to K. 


Proof. We have given that 


plAB] < (CD) 
and e ux 
r[4B] < s[DE]. 
Obviously 
pr. gr ps 
q viam qs 
Thus we need to prove that 
(qr + ps)[AB] < as(CE]. 
Now P 
qr(AB] < asl DE] 
and 


ps[AB] < qs{CD), 
by Theorem 4, Section 20.3. Therefore 
(qr + po) LAB] < as([CD] + (DE) = gF], 
which was to be proved. 


Theorem 6. If p/q does not belong to Kı, and r/s does not belong to K2, then 
p/q + r/s does not belong to K. 


256 FROM EUDOXUS TO DEDEKIND 


Proof. Given 
p[AB] = [0D] 
and DI 
r[AB] = s[DE], 
we need to show that 


(ar + ps)[AB] = qs{CE). 
The proof is very much like that of the preceding theorem. First, 
qr(AB] = qs[DE] 


and ERA 
ps{AB] = qs[CD], 


by Theorem 4, Section 20.3. Therefore 


(ar + ps)[AB] = qs((CD] + (DE) 


= qs{CE], 
which was to be proved. 


We are now ready to finish the proof of the addition theorem. In the notation 
of this section, the theorem says that 


ky + ko = k. 
If this is false, then either 

ki the  k (1) 
or 

ki + kg < k. (2) 


We shall show that both of these are impossible. 
If (1) holds, then ky + ka — k > 0. Let 


e= ki + k — k, 
so that 
k= ky + ke >g, 
By Theorem 3, there is a p/q in K, such that 
p La 
Q^ 2' 
and there is an r/s in K3 such that 
r e 
ets k 
It follows that 
Sipe ki + ka — e. 


Therefore p/q + r/s does not lie in K. But by Theorem 5, p/q + r/s must lie in 
K. This gives a contradiction, showing that (1) is impossible. 
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If (2) holds, then k — ky — ko > 0. Let 


e = k — kı — kz; 
so that 
k = kı + ka +e. 


Let p/q and r/s be rational numbers such that 


kı <Pch+§ 

and 
r e 
ko tees Sut 


Then p/q does not belong to Kı, and r/s does not belong to Kz. By Theorem 6, 
p/q + r/s does not belong to K. Since 


t t 
ftlo<t<cx, 
it follows that Sue 
q'se k. 


But this is impossible. Adding our previous inequalities for p/q and r/s, we get 


Ades ki É+ ka t5 = ki F ka tem k. 
This gives a contradiction, showing that (2) is impossible. Therefore the addition 


theorem is true. 

You should check back over the whole proof, with a view to finding out whether 
any of the steps can be left out. (All the steps looked necessary to the author, at 
the time when the above was written.) ; 


20.6 THE METRIZATION THEOREM 


In this book, we have considered Euclidean geometry from two viewpoints, the 
metric and the purely synthetic. In the metric approach, the postulates tell us 
a great deal about the relation between our geometry and the real number system: 
the structure is . 

[S, £, 6, d, m]. 
where d is the distance function 
d:SXxSoR. 


The distance function obeys the ruler postulate; and congruence and betweenness 
are defined in terms of distance. 
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In the purely synthetic approach, which Euclid used, the real number system is — 
nowhere mentioned. The structure is E 


[S, £, 6, =, G]. 


Here & and @ are undefined relations of congruence and betweenness, subject to 
the postulates stated in Chapter 8; and the only numbers that get used are: the 
positive integers, which are used to count things. t 
The first really big step in geometry after the Greeks was the invention of co- 
ordinate systems, by René Descartes. Obviously, to set up a coordinate system 
in the plane (or in space), you must have a distance function. In fact, to label the 
points of the z-axis with numbers, you must have gotten, from somewhere, à CO- — 
ordinate system for the z-axis, satisfying the conditions of the ruler postulate. 
We shall now show that in a purely synthetic geometric system we can define & — 
distance function satisfying the metric postulates. To be exact, we shall prove the 
following theorem. t 


Theorem 1. Given a geometric structure 
[S, £, F, =; 8), 


satisfying the postulates of Chapter 8, and satisfying the Archimedean postu- - 
late. Let A and B be any two points. Then there is a function 


d:SxS—R, 

such that 

(1) d satisfies the ruler postulate, 

(2) CD = EF if and only if the distances CD and EF are the same; 

(3) C-D-E if and only if CD -- DE — CE, and 

(4) AB — 1. 

That is, we can always define a distance function which gives us back the same 
congruence relation for segments and the same betweenness relation for points 
that we had before; and we can always set up our distance function in such a way f 
that any given segment AB is the “unit of length.” | 

When we say that d “satisfies the ruler postulate,” we mean that for any line L ~ 
there is a coordinate function 

[LR 


of L into the real numbers, such that if x = f(P) and y = f(Q), then 
PQ = |e — yl. 


It is not claimed that every real number z is the coordinate f(P) of some point P; 
that is, we do not claim that all of the real numbers get used as coordinates. In 
fact the latter statement cannot be proved on the basis of the synthetic postulates 
that we have stated. (The easiest way to see this is to observe that the surd plane - 
satisfies the synthetic postulates; and in the surd plane, only the surds get used as 
coordinates.) If the coordinate functions are one-to-one correspondences f: L €» R, 


THE METRIZATION THEOREM 259 


then the geometric structure [S, £, €, =, $] is called complete in the sense of Dede- 
kind. 

We now proceed to the proof. Fortunately, the hard part of the proof, which is 
to set up our distance function in such a way that any given segment AB is the 
“unit of length,” is already over with. For any two points P, Q, let 


PQ = d(P,Q) = PQ: AB, 


where PQ: AB is the ratio defined and studied earlier in this chapter. Since the 
ratio was defined, in the first place, in terms of congruence classes [PQ], we know 
immediately that (2) is satisfied. 

To prove (3), we first observe that if C-D-E, it follows by the addition theorem 
that 

CE: AB = CD:AB + DE: AB. 
Therefore 
CE = CD + DE, 


by our definition of distance. (This is the first of the two things that the addition 
theorem is good for.) Suppose, conversely, that 


- CE = CD + DE, (1) 


where C, D, and E are all different. If it were true that D-E-C, then it would 
follow that 

DC = DE+ EC, 
80 that 

CE = CD — DE; (2) 
and from (1) and (2), we get 


CD + DE = CD — DE, 


or 2DE = 0, or DE = 0. Therefore D = E, which is a contradiction. If we 
suppose that D-C-E, this leads to a contradiction in the same way. But one of 
the three points must be between the other two. Since D-E-C and D-C-E are both 


false, it follows that C-D-E must be true. 
Also, (4) is satisfied. Obviously 


pAB] < AB] 
precisely if p < q, which means that p/q < 1. Therefore 
I: = mp [|o <2 <1)=1 
and d(A, B) = 1. 


It remains only to check the ruler postulate. Given a line L, and three points 
C, D, E of L such that C-D-E (Fig. 20.2). 
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Cort DP? E 
t — — 9—  ]— e—_ —.— —» 
Figure 20.2 
We shall set up a function 
: FLmR, 


and show that f is a coordinate system satisfying the ruler postulate. 
If P belongs to the ray DB, let 


J(By = DP. 
We shall check that if P and Q are points of DË, with coordinates x, y, then 
PQ = |z = yl; 


as required in the ruler postulate. If either P or Q is — D, this is easy to see. 
For example, if Q — D, then 


PQ = DP = f(P) = z= |e| = |x — 0. 


If P = Q, it is also trivial, because then PQ = 0 = |x — z|. Suppose then, 
finally, that P, Q, and D are all different. Then either D-P-Q or D-Q-P. If D-P-Q, 
then 

DQ: AB = DP: AB + PQ: AB, 


by the addition theorem. (This is the second of the two things for which the addi- 
tion theorem is useful.) In terms of distance, this tells us that 


DQ = DP + PQ, 
or 


PQ = DQ — DP=y-x. 
Here y — x > 0, because y — x = PQ. Therefore 


PQ = |z dh yl, 
which was to be proved. 


We now define the coordinate function f, for points P of the opposite ray T6, 
by the condition 
f(P) = —DP. 
We now have 
P Q = |z 7€ yl, 


if P and Q both belong to De. (The proof is almost exactly the same as for the ray 
DË: the point is that |z — y| is unchanged when the signs of z and y are reversed.) 
If P belongs to DC and Q belongs to DË, then 


PQ=PD+DQ, z--—PD, y= DQ. 
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Therefore 


ll 


PQ 


Yi — E 
l — yl, 


y 


which was to be proved. 

The reader will observe that the proof of the metrization theorem requires all 
of the apparatus set up in the present chapter. It is plain that such proofs as this 
do not form a part of elementary mathematics. Nevertheless it is quite usual, in 
elementary geometry courses, to give some sort of “indication of proof” for what 
amounts to a metrization theorem. Usually this is done just before the treatment 
of similarity, so as to permit an algebraic treatment of proportionality. Usually 
the same discussion is repeated, when coordinate systems are introduced. The 
reader should now be able to judge the extent to which these “indications of proof” 
are masterpieces of expository terseness. 


20.7 THE DEDEKIND CUT 


If the ring Z of integers is given, it is not very hard to set up the ring Q of ra- 
tional numbers. To pass from the rational numbers to the real numbers is another 
matter. This, however, is what we need to do, to show that there really is a number 
system which satisfies the field postulates of Chapter 1 and also the Dedekind 


postulate. 
One of the most elegant approaches to this problem was devised by Dedekind 


himself, leaning heavily on the ideas of Eudoxus. For the sake of convenience, we 
shall restrict ourselves to positive numbers. (Once we have them, it is not hard to 
set up their negatives.) 

We recall that in Section 20.4, we defined 


AB:CD = sup K, 
where 


K = [2 | eb) < aa8]- 


It appeared in Section 20.5 that K was always a cut in the rational numbers. That 
is: 

(1) K is a set of rational numbers. 

(2) K is not empty. 

(3) K has an upper bound. 

(4) If 0 < p/q < r/s, and r/s € K, then p/q € K. 

If the real number system R is regarded as given, then to every cut K there cor- 
responds a unique positive real number sup K. On the other hand, cuts are de- 
fined purely in terms of the rational number system Q. We can use this fact to 
define a set of objects which can be regarded as the positive reals. 


Derinition. A positive real number is a cut in the positive rationals. 


Let R* be the set of all cuts. In Rt we need to define addition, multiplication, 
and order. 
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(1) If K and L are cuts, then 
K+L= @+yle €K,y EL. 
(2 K-L = {ylz € K, y € L}. 


We need to show, of course, that the sum and product of any two cuts are 
cuts. It is then easy to check that R* satisfies the usual algebraic identities gi 
in the field postulates. For example, 


K+L=L+ K, 
because 
{z + ylz E€ K, y E L} = {y + zly € L,x e Kj. 


(3) We define K < L to mean that K is a proper subset of L, that is, K C L 
and K # L. 

Under this definition, when we say that K is an upper bound of Z, this m 
simply that every cut belonging to Z is a subset of K. It is now refreshingly easy 
to see that every bounded set of cuts has a supremum: sup Z is simply the union. 
of all of the sets belonging to Z. This forms a cut; it is an upper bound of Z; and. 
no smaller cut is an upper bound of Z. 

The above discussion is, of course, merely a sketch. For a relentless pursuit t 
all the details, see the latter portion of E. Landau's Foundations of Analysis. : 

A rather good case can be made out for speaking not of the Dedekind cut but of 
the Eudoxian cut. The crux of Dedekind’s procedure was to use cuts as a working 
definition of real numbers; and at bottom this is precisely the procedure that 
Eudoxus had used, over two thousand years before. 


aire 


LENGTH AND PLANE AREA 


21.1 THE DEFINITION OF ARC LENGTH 


Given an are AB of a circle C: 


Figure 21.1 


We take a sequence of points 
A = Ao, Åi, A2,.--, An = B, 


in the order from A to B on the arc; and for each pair of successive points A;—, 
A; we draw the segment A;_1 Ai, as indicated in the figure. The union of these 
segments is called an inscribed broken line; and the sum of their lengths is denoted 
by pn. Thus 


pn = AoA Aida +: + An—1An 


n 
= >) AcaAe 
i=l 
There are now various ways that we might define the length of the arc AB. If 
we merely want to state a definition, as a matter of form, without intending ever 
to apply it, then our problem is simple. We agree to use equally spaced points 
Ao, A1,-++,An- The length pa of our broken line is now completely determined 
by n, and we can define the length to be 
p = lim pn. 

no 
To justify this, we would need to explain what is meant by lim, s, and we would 
have to show that the indicated limit really exists, for every circular arc. 
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The following definition, however, is far more manageable. Let P be the set 
of all numbers p, which are lengths of broken lines inscribed in AB. Thus 


P'a fon Pn = 9x Aia]: 


p = sup P. 


Let 


To justify this, we need to prove the following theorem. 


Theorem 1. P has an upper bound. 


It will then follow, of course, that P has a least upper bound sup P. The proof 
is easy on the basis of the following preliminary result. Let APQR be an isosceles 
triangle, with PQ — PR. 


p 


Figure 21.2 


We assert that if P-Q-S and P-R-T, then 
ST > QR. 


Suppose that PS < PT, as in the figure, and take U between R and T so that 
SU || QR. Then 


SU. P8 
2 1. 
QR i 
Therefore i 
SU > QR. 


We shall now show that ST > SU. Evidently Zl is acute, because Z1 is & 
base angle of an isosceles triangle. Therefore Z2 is obtuse. Therefore Z.3 is acute. 
Therefore mZ3 < m2. Therefore ST > SU. (Why?) 

We now return to our circular are, Draw any square that contains the whole 
circle in its interior (Fig. 21.3). We project each point Ai onto the square, as in- 
dicated in the figure. That is, A7 is the point where DA, intersects the square. 
Then A;_1A; < Aj_1Aj. Therefore p, is always less than 7%, A/_,Aj. There- 
fore p, is always less than the perimeter of the square. Thus the perimeter of the 
square is the upper bound thet we are looking for. 
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Ficure 21.3 


This justifies our definition 
p — sup P. 
Of course a circle is not an arc of itself, under our definition of an are. But we 
can define the cireumference of a circle in an entirely analogous way; by setting 
up an inscribed polygon with vertices 


Ao, Ai, ...3 An-1, An = Ao. ^ d 


We then let p, be the perimeter 
jx Aiii, 
ici 

let P be the set of all such perimeters ,, and define the circumference as 
p = sup P. 

21.2 THE EXISTENCE OF r 


We now want to prove the existence of the number m. To do this, we need to 
show that the ratio of the cireumference to the diameter is the same for all circles. 
This is a theorem about sup's, and to prove it, we need a preliminary result on 
sup’s. 

Let P be any bounded set of positive numbers, and let k be a positive number. 
Then kP denotes the set of all numbers of the form kp, where p belongs to P. For 
example, if 

P= [0,1] = (0s z < 1}, 
and k = 3, then 
KP =P = [0, 3]: 
If P = [1, 2] and k = 3, then kP = [%, $]; and so on. 
This “multiplication” is associative. That is, 


J(kP) = (jk)P, 
because 
{jz|x € kP} = {j(kp)|p E P) = (Gh)plp € P}. 
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Thus, for example, we always have 
1 
5 (kP) = P. 


Lemma 1. If bis an upper bound of P, then kb is an upper bound of kP. 
Reason. If p < b, then kp S kb. 


Lemma 2. If c is an upper bound of kP, then c/k is an upper bound of P. 
Proof. Since P = (1/k)(kP), this follows from Lemma 1. 
These lemmas give us the following theorem. 


Theorem 1. sup(kP) = k sup P. 


Proof. Let b = sup P. Then b is an upper bound of P. By Lemma 1, kb is an 
upper bound of kP. 


Suppose that kP has an upper bound 
- c < kb. 


Then c/k is an upper bound of P, by Lemma 2. This is impossible, because c/k < b, 
and b was the least upper bound of P. 


We can now prove the theorem which establishes the existence of m. What is 
needed is the following theorem. 


Theorem 2. Let C and C' be circles with radii r, r’ and circumferences p, p’. Then 


ae; 
2r 26 
"That is, the ratio of the cireumference to the diameter is the same for all circles. 
This common ratio is denoted by 7. 
Proof. Suppose that the circles have the same center. (This involves no loss of 
generality.) In the figure, we indicate the ith side A; 14; of a polygon inscribed 
Ai 


FicunE 21.4 
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in C. To each such polygon there corresponds a polygon incribed in C’, obtained 
by projection outward (or perhaps inward) from the common center D. We then 
have 
ADA;_14; ~ ADA; aA. 
Therefore 
A4, DA, nr 


If the perimeters of our polygons are p, and pp, then we have 


r 
Ph = >" Pn 
Let 
p= supP, and p' = sup P’, 


where P = {pn} and P’ = {pn}, as usual. Then 
Pom. p. 
T 
Therefore, by the preceding theorem, using k — r'/r, we have 
ae t ET. 
mxmpPe-mpPb oc r UU y, 


Dividing both sides by 2, we get the equation called for in the theorem. 
An analogous theorem holds for circular arcs. 


Theorem 3. Let AB and ÁP’ be arcs of the same degree measure, in circles of 
radius r and 7’, respectively. Let the lengths of AB and Á'B' be p and p'. Then 


PIT. 
r p 
p 
P B 
B 
A A 
Ficure 21.5 


This ratio is called the radian measure of the are AB. If AB is a minor arc, 
then p/r is the radian measure of the angle ZBCA. The theorem tells us that the 
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radian measure really depends only on the angle, or on the degree measure of the 
arc, and is independent of the radius of the circle. 
The proof is virtually identical with the proof of the preceding theorem. 


21.3 LIMITS AS THE MESH APPROACHES ZERO . 


Given ] 
p = sup P, 


we know that we can find numbers p, in the set P as close to the sup as we please. 
More precisely, if e is any positive number, there is a p, in P such that 


Tn > p. UE. 


This was Theorem 3, Section 20.5; and it is true because otherwise p — e would 
be an upper bound of P. 

We ought, however, to be able to make a stronger statement than this about 
the numbers p, and their sup. To get p, close to p, we should not have to choose 
the inscribed broken line in any special or clever way. It ought to be true that ps 
is close to p whenever the sides of the inscribed broken line are very short. We 
shall make this idea precise in the next theorem. 

Let | 

= Ao, 301 .-.-, Ag =B 


be the vertices of a broken line B, inscribed in the arc AB. By the mesh of the — 
broken line Ba, we mean the largest of the numbers A;_;A;. Thus the mesh of a | 
broken line B, is the length of its longest segment. In this language, the statement 
that we want to prove can be stated roughly as follows. | 

| 


Pn i$ as close to p as we please, if the mesh AB, is small enough. 


This suggests the idea, but it is not exact enough to form the basis of a proof, 
because it involves highly nonmathematical terms, notably the terms please and 
enough. Statements like this are like bowling balls without finger holes: they are 
easy to look at, but awkward to handle. The corresponding mathematical state- 
ment follows. 


Theorem 1. Let ÁB be an arc of length | 
p = sup P, 


where P is the set of lengths pp of inscribed broken lines B,. For every positive 
number e there is a positive number d such that if the mesh of B, is less than d, 
then p, > p — e. 


Since we always have p, S p, the inequality p, » p — e means that 
lpn — p| < e, that is, p, is within a distance e from p. To see what the whole 
theorem means, let us consider that the author of the theorem has made a promise. 
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He has promised to give helpful advice on how to make p, close to p. People 
come to him and tell him how close they want p, to be to p. They do this by 
naming the number e that they want p, — p to be less than. He then tells them 
how small the mesh must be to give the result that they want; he does this by nam- 
ing the number d that the mesh must be less than. In stating the theorem, the 
author is promising that no matter what number e is named, he will not be at & 
loss: he will always be able to produce a number d that suits the requirements. 

The proof of this theorem is nowhere nearly as difficult as its statement. It 
reads as follows. 

(1) Let BJ be an inscribed broken line, with length pa, such that 


ROB 


(We know that there is such a broken line, because p = sup P.) 

(2) Now let Bm be any inscribed broken line, of length pm. Let Bi’ be the broken 
line obtained by using all of the vertices of Bm and all of the vertices of By. Let 
pt’ be the length of By’. Here the points marked with crosses are vertices of By. 
The points marked with little circles are vertices of Bm. The end points A and B 
are marked both ways because they must be vertices of both. And all the indicated 
points are vertices of By’. 


O : Bm 
. ^ 
xo Bl A 


Figure 21.6 
By repeated applications of the triangular inequality, we have 
p! 2 pw 
de 


Therefore 


This merely says that when you insert new vertices, the length of a broken line 
increases. 

Of course we cannot claim that p, = pt’. It can easily happen that Bm is 
shorter than BY’, because Bm may take “shortcuts” past vertices of By’ (see Fig. 
21.7). Here PR < PQ+ QR. Thus the length of PR is less than the sum of the 
lengths of the corresponding sides PQ, QR, of By’. 

The question is how many times you can gain by these shortcuts, and how much 
you can gain each time. The number of times the shortcuts can appear is surely 
no more than n — 1, because there are no more than n — 1 possibilities for Q. 
The saving at each shortcut is 

PQ + QR — PR. 
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Figure 21.7 


This is surely no bigger than PR, because PR is the longest side of APQR. If k 
is the mesh of Bm, then PR < k. Thus there are at most n — 1 shortcuts, and the 
distance saved at each of them is Sk. Therefore the total saving is 


p! — Pm $ (n — Ik. 
Now we know that 


Therefore 


What we want to get is 


DP — Pm < € 
This will hold if 

(n — Dk < j' 
or 

€ 

k S m —1) 
Our problem is now solved: let 

i733 


If the mesh k of Bm is less than d, then p — Pm is less than e, exactly as we wanted 
it to be. 

We state Theorem 1 briefly by saying that p, approaches p as the mesh of Bn 
approaches zero. This is, of course, the same sort of limiting process that is in- 
volved in the theory of definite integrals. 


21.4 THE ADDITION THEOREM FOR ARC LENGTH 


One of our postulates for angular measure was the addition postulate. This said 
that if C lies in the interior of Z DAB, then mZDAB = mZ DAC + mZCAB. 


Ficure 21.8 
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The corresponding statement about degree measure of circular ares was a theorem. 
PA £m 

The theorem stated that mÁBO = mÁB + mBC. A corresponding theorem 

holds for arc length. 


Ficure 21.9 


Theorem 1. Let ÁB and BC be arcs of the same circle, with only the point B 


— 


in common. Let sı and sz be the lengths of AB and BC ; and let s be the length 


of ABC. Then 
81 + 8 = S. 


The proof is a fairly easy exercise in the use of least upper bounds and limits 


as the mesh approaches 0. 
(1) Suppose that s; + s2 > 8, 50 that sı + s2 — s > 0. Let 


C= 8; Hgm s: 
Let B, be a broken line inscribed in AB, with length pnr, such that 
e 
Pn wd 2' 
Let BJ, be a broken line inscribed in BC, with length Ph, such that 
Pn > 82 — Y 


Fitting these broken lines end to end, we get a broken line 7/ àx of length Pm+n 
such that 
Pm-+n = Pn + Ph: 
Since BY, is inscribed in ABC, it follows that py, € s. On the other hand, by 
addition, we get 
Plin = Pac Pm > 81 + 82 — € 8 
Thus pn $ s and Pirin > 8, which is a contradiction. 
(2) Suppose that sı + 82 < $, $0 thats — sı — 82 > 0. Let 


e= 8 — 8, — 82, 
so that 
s= 8, + 82 € 
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Let d be a positive number such that if B, is inscribed in ABC and has length p, 
and mesh less than d, then 
$4» 8 —€. 


(By Theorem 1, Section 21.3, there is such a d. That is, p, is as close to s as we 
please if only the mesh of B, is small enough.) 

Now let B, be a broken line inscribed in ABC, of mesh less than d, such that B 
is a vertex of Bn. Let pn be the length of Ba. Then B, can be broken up into two 
broken lines BL, BY’ (m + r = n), one inscribed in AB and the other inscribed 
in BC. Ifthe lengths of these broken lines are pm and p;’, then we have 


Pm + Pr = Pr, 


P» $5, 
Therefore 


But sı + s2 = s — e. Therefore 
Pn => e, 


which gives a contradiction. 

If you review the proof of the addition theorem for ratios AB: CD, in Section 
20.5, you will find that this proof is very similar to it. The technique involved 
here is quite important enough to be worth going through twice, or more. 

We remember that when we defined arc length by means of broken lines, we 
allowed broken lines whose sides were not necessarily of the same length. This 
possibility was important in the second part of the proof of the preceding theorem. 


21.5 APPROACHING THE AREA OF A CIRCULAR SECTOR FROM BELOW 


So far in this chapter, the mathematics has been exact and complete. We have 
given a definition of the length of a circular are, and we have proved theorems 
strictly on the basis of the definition. We shall approach the problem of plane 
area, for the corresponding circular sectors, in a more backhanded fashion. First 
we shall calculate the areas of such figures numerically, by a method which will 
no doubt be familiar to the reader in one form or another. We shall then reexamine 
the situation, and see just what we needed to assume to justify our calculation. 
Then, in the following chapter, we shall develop a theory of plane area, sufficiently 
general to apply to the familiar elementary figures, and show that in this theory, 
the area formulas of this chapter become genuine theorems. This backhanded 
approach will involve very little lost motion. And the theory given in the next 
chapter will be much easier to understand if we first survey the situation in a par- 
ticular case, and get a rough notion of how the theory ought to work. 
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Figure 21.10 


By a circular sector we mean à figure like the one above. To be exact, if ÁB is 
an arc of a circle with center at C and radius r, and K is the union of all radii CP, 
where P is in AB, then K is a sector; r is called its radius, and ÁB is its boundary 


arc. 
If we use all of the circle (instead of an are ÁB) then the union of the radii CP 


is the circle plus its interior. Such a figure is called a disk. We shall begin our in- 
vestigation by proving the following theorem. 


Theorem 1. Let K be a circular sector with radius r and boundary arc of length 
s. Then there is a sequence of polygonal regions 
K 1 K 2y***5 
all contained in K, such that 
lim eK, = 3's. 
n» 


Here aK, denotes the area of Kn. : 
Proof. We begin by inscribing in AB a broken line B, in which all the sides are 
congruent, of length bn: 


Figure 21.11 


In the figure, r is the radius of the circle. Obviously all the triangles AA; 1CA; 
are congruent. Therefore they all have the same altitude (measured from C to 
the opposite side A;—14i:)- This common altitude is called the apothem, and is 
denoted by an. Thus the area of each of our triangles is 3@nbn. Let K, be the 
polygonal region which is their union. Then 


z aK, = 4nasbs. 
Let s be the length of AB. 
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We now want to see what happens to bn, dn and aKn, as n — œ. 
(1) The number nb, is the length of our inscribed broken line. Therefore 


Nog S's: 
Therefore 


bn S 


Slo 


Since 
lids Senin’? igi ue 0, 


na N no To 


it follows by the squeeze principle that 
lim b, = 


(2) Since the mesh of the inscribed broken line is bn, and lim, 4 bn = 0, it 
follows that 
lim nb, — s. 


no 


Here we are using the fact that the length of the inscribed broken line approaches 
s as the mesh approaches 0. 
(3) Examining a typical triangle A.A;_C'A;, we see that 


Therefore 
IT 
T — > < dcc ov. 


Since lim, ,4 b, = 0j it follows that 


By the squeeze principle, this means that 
$ lim a, = r. 
no 
(4) Fitting together (2) and (3), we get 
lim (aK,) = lim [$ n+ nba] = drs. 
—90 no 


This proves our theorem. 


21.6 APPROACHING THE AREA OF A CIRCULAR SECTOR FROM ABOVE 


It would now seem natural to show that there is a sequence Li, Lo, ... of poly- 
gonal regions, each of them containing the sector K , Such that 


lim aL, = }rs. 
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The following theorem, however, is easier to prove, and will be sufficient for our 
purposes. 


Theorem 1. Let K be a circular sector with radius r and boundary are of length 
s, and let e be any positive number. Then there is a polygonal region L, con- 
taining K, such that 

aL < irs +e. 


To find such a polygonal region, we draw a cireular sector K’ with the same 
center, and radius r' > r: 


CA-r, CA'-r 
Fieure 21.12 


In the figure, the broken line inscribed in the smaller circle is the same as before; 
and we have also indicated the corresponding broken line, inscribed in the larger 
circle. For the second broken line, the mesh is b} and the apothem is an LetL 
be the polygonal region inscribed in the larger sector. Then 


qeu "rm 
aL = 5 nbn. 


Let s’ be the length of AB . By Theorem 3, Section 21.2, we have 


8.3 
p 
Therefore 
s=". 
Therefore 
nb, s TS. 
Also, by similarity, 
dien d 
as) f^ 
so that 
+2 
auo mp asit alu 


So far, all of this holds true for every r' > r, and for every n. What we need is 
to choose r' so that aL < Àrs + e, and then to choose n so that K C L. on 


\ 
. 
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(1) We want 

2 
TE < ret 2e, or <et 
and this will be true whenever 


r' < vr? + 2er/s. 


We take an r’ > r, satisfying this condition. ; 
(2) Since lim, ,4 a, = r’, it follows that a4 > r for some n. Such an n gives 
us an L that contains K. 


21.7 THE AREA OF A SECTOR 


We can now show that if area theory behaves in any reasonable fashion, then 
the area of a circular sector must be given by the formula 


aK = irs. 


Our conception of reasonableness is conveyed by the following assumptions. 


Assumption 1. There is an area function a, defined for a class M of figures. The 
class M contains, at least, all polygonal regions and all cireular sectors and disks. 


Assumption 2. If K is a polygonal region, then aK is the area of K in the ele- 
mentary sense. 


Assumption 3. (Monotonicity.) If K and L belong to M and K C L, then 
aK S aL. 


Under these three assumptions, we can prove that our formula holds. Let K Ly 
Ko, ... be as in Theorem 5, Section 21.5. Then aK, < aK for every n. Therefore 


drs = lim aK, S aK. (Why?) 
It cannot be true that 3rs < aK. If 80, let 
e = aK — irs, 
and let L be as in Theorem 1, Section 22.6. Then 
aK S aL < hrs +e = aK. 


Thus aK < aK, which is impossible. (You have a good deal of choice, in de- 
ciding how to express the contradiction in proofs like this.) 

Thus there is no reasonable area theory in which the formula ak = 175 fails to 
hold. In the following chapter we shall replace this negative statement by a posi- 
tive one: we shall show that there is an area function a for which our three assump- 
tions are valid. 


te a 
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22.1 THE BASIC DEFINITION 


It is fairly easy to define an area function for which the three assumptions that 
we made in the preceding section are valid. The definition goes like this. First, 
let o be the usual area function for polygonal regions. Now given a set K of points 
of the plane, let Pr be the set of all polygonal regions P that lie in K, and let Nr 
be the set of all numbers aP which are areas of elements of Py. Let 


mK = sup Nr. 


The number m;K is called the inner measure of K. If it happens that K contains 
no polygonal regions at all, then we agree that m K = 0. Thus the inner measure 
of a point or a segment is always = 0. 

Suppose now that K is contained in at least one polygonal region. Let Po be the 
set of all polygonal regions P that contain K. Let No be the set of all numbers 
aP which are areas of elements of Po. Let 


moK = inf No. 


Here inf No is the greatest lower bound of Nọ. The number inf No is called the 
outer measure of K. 
If P € P; and P' € Po, then 
PERCP: 
Therefore 
BGP, and aP S aP'. 


Thus every element of N is less than or equal to every element of No. The two 
sets of numbers must therefore look like the figures below. The figures suggest 
that we must have 


mK £ mK. 
Ni No Ni No 
PAN. Ld a Ge eH 
a ee 
0 miK moK 0 mK=moK age 
Figure 22.1 ; 
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And in fact this always holds. If it were true that 
moK < mjK, 
then there would be a polygonal region P, lying in K, such that 
moK < aP S mjK. 
And aP is a lower bound of No, because 
aP S aP' 


for every P’ in Py. Therefore moK is not the greatest lower bound of No; and 
this is impossible, because moK was defined to be inf No. 
Thus 
mK s moK. 
If the equality holds, then we say that K is measurable in the sense of Jordan, and 
we define the measure of K to be 


mK = mK = mK. 


Since we shall be talking about only one kind of measure theory in this book, we 
shall say for short, that K is measurable if m;K = moK. We now have the follow- 
ing theorems. 

Theorem 1. Every polygonal region P is measurable; and mP = aP. 


Proof. P belongs to Pr, because P C P. And if P' c P, then oP' € aP. There- 
fore m;P = aP. 


Similarily, P belongs to Po, because P > P. And if P' 5 P, then aP’ = aP. 
Therefore mo9P = aP. Therefore 


mP = mP = m,P = aP, 
which was to be proved. 


Theorem 2. Outer measure is monotonic. That is, if A C B, then mpA € moB. 


Theorem 3. Measure is monotonic. That is, if A and B are measurable, and 
A C B, then nA € mB. 


The proofs are left as exercises. 


Theorem 4. Every circular sector K is measurable. If the radius is r, and the 
boundary arc has length s, then 


mK = irs. 
Proof. The results of the preceding section tell us that 


mK = is, and = moK < hs. 
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Since myK € moK, it follows that 
mK = mK = 37s, 
which was to be proved. 


If you make a mental review of our investigation of circular ares in the preced- 
ing chapter, you will see that the definition of Jordan measure is modeled on it. 
One way of putting it is to say that in the preceding chapter we gave & proof, and 
that in the present section our task has been to find the theorem that our proof 


proves. 
By a discussion exactly analogous to the discussion in Section 21.7, we can show 


that if K is a disk of radius r and hence of circumference s = 27r, then 
mK = 4rs = moK. 
Thus we have the following theorem. 


Theorem 5. Every disk K is measurable, and 
mK = tr’, 
where r is the radius. 


It seems that we have worked rather hard to turn the familiar formula A = 7r 
into a theorem. But our work has been necessary. The easiest way to see this is 


to try to think of a simpler approach. 


Prosiem Ser 22.1 


. Show that every point P forms a measurable set, and that mP -— 0. 
. Show that every segment AB is measurable, and that mAB - 0. 


nore 


3. Prove Theorem 2. 
4. Prove Theorem 3. 
5. Consider a plane with coordinate system. Let K be the unit square: 


FIGURE 22.2 


Let L be the set of all rational points of K. That is, (z, y) belongs to L if z and y are 
both rational, 0 € z $ 1 and0 S y S 1. Is La measurable set? Why or why not? 
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Figure 22.3 


6. In a coordinate plane, consider a right triangle in the figure above. We divide the 
interval (0, a] into n congruent segments, each of length a/n, and construct an inscribed 
polygonal region P, and a circumscribed polygonal region Ph. (In the figure, the boundary 
of P, is drawn solid, and that of P% is dashed.) Calculate aP,andaP’,. Verif y algebrai- 
cally that 


sup(aP,) = inf(aP;) = jab. 
This problem throws some light on the theory. It means that if we know only about 


areas of rectangles, we can caleulate the Jordan measures of right triangles in the in- 
dicated position. 


22.2 THE CLASS OF MEASURABLE SETS 


Let M be the class of all measurable sets in the plane. (Here, as usual, we use 
the word class as a synonym for the word set.) We shall show that this class has 
the following simple properties. 


Theorem 1. If M, and M; belong to M, then so also does M, U Mo. 
It will follow by induction, that: 


Theorem 2. Finite Additivity. If each of the sets M 1, M5, ..., Mn belongs to 
M, then so also does their union 


Theorem 3. If M, and M; belong to M, then so also does M; — M: 
Theorem 4. If M, and M; belong to M, then so also does M, N Mo. 
Theorem 5. If M, and M; belong to M, and 

MinM;-9O, 


then 
m(M, U M3) = mM, + mMs. 


| 
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Theorem 6. If M, and Mz belong to M, and M, C M5, then 
m(Ms — M) = mM» — mM ;. 
Theorem 7. If M, and M3 belong to M, and 


m(M; n M3) — 0, 
then 
m(M, U M5) = mM, + mMs. 


The proofs begin with the following lemma. 


Lemma 1. Let M be a measurable set in the plane. Then for every positive 
number e there are polygonal regions P and P' such that 


PCMc?P' (1) 
and 
aP' — aP < e. (2) 
Proof. Take P so that P C M and 
aP > mM — 5° 
Take P’ so that M C P' and F 
aP' < moM + 2' 


'Then E 
—aP < —mjM + 3j 


and by addition, we get 
aP' — aP < e. 
(We are using, of course, the fact that mM — mM = 0.) 


The converse is also true. 


Lemma 2. Let M be a set of points in the plane. Suppose that for every positive 
number e there are polygonal regions P, P’ such that 


PCMCP' (1) 


and 
oP’ = aP € e. . Q 


'Then M is measurable. 
Proof. Given such regions P, P', we have 
moM S aP', aP S mM. 


Therefore 
—m,M s —aP, 


and so 
moM — mM S aP' — aP < e. 
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Since mM — mM < e for every positive number e, and m9M — mM = 0, it 
follows that mo9M — m;M must be = 0. Therefore M is measurable, which was 
to be proved. 


We shall now prove Theorem 1. Consider two measurable sets M p Ms. Let 
€ be any positive number: 


Pi 


Xs 


/] 


Figure 22.4 


Take Pı, P1 for Mj, as in Lemma 1, so that 


P,CMiCP| 
and 

aPi — aP, < 5- 
Take P5, P3 for M, so that 

P CM: CP 
and 

aP, — aP, < $. 
Let 

P = Pi UP,, 
and let 

P' = Pi UP. 
Then t ? 


aP' — aP S (aP — aP) + («P5 — aP;). 


To see how this works, consider a simpler figure: 


FiavnE 22.5 


THE CLASS OF MEASURABLE SETS 283 


FIGURE 22.6 


Here aP’ — aP is the area of the thin strip in the figure above. This is € the 
sum of the differences 
aP, — oP, aP% — aP3. 
'The same principle applies in the general case. If we form a suitable triangula- 
tion of P! U P4, then each of the three differences 


aP' — aP, aP, — oP, aP5 — oP2 
is the sum of the areas of a collection of triangular regions, and every triangle 
that contributes to «P — aP’ must contribute at least once, and perhaps twice, 


to (aP1 — aP1) + (aP$ — aP3). 
Therefore 


and so M, U Ma is measurable, which was to be proved. 

Theorem 2 follows, as we pointed out, from Theorem 1. 

To prove Theorem 3, we use the same figures. To prove that M, — Mais 
measurable, we form à triangulation of Pi U Pj in which each of the regions P1, 
P1, P5, P5 is a union of triangular regions, intersecting only in edges and vertices. 
Let P’, this time, be the union of all the triangular regions that lie in P^, but not 
in Py. Let P be the union of all of the triangular regions that lie in Pı, but not 


in P}. Then P and P' look like this: 


p 
; r | 
P Li E 
-i 


FIGURE 22.7 


Here the boundaries of P and P" are drawn solid, and the rest of the figure is dashed 
merely to remind us of how P and P' were defined. We recall that e was any 


positive number, and aP{ — aP, < e/2; aP4 — aP2 < e/2. 
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Evidently, 


aP’ — aP S (aP1 — aP1) + (aP — aP;). 
Therefore 


aP' — oP <5 +5 = 6; 


and so M, — M; is measurable, which was to be proved. 
To prove that M; N M; is measurable, we merely need to manipulate sets, 
without using either geometry or algebra. A figure will make it easier to keep 


track of what we are doing. 
M; 
M, 


1 
Figure 22.8 


Given M, E M, M, E M. By Theorem 1, 


Mi; U M5 EM. 
By Theorem 3, 


M: — Miem, Mı — MEM. 
Therefore, by Theorem 1, 
(Mı — Mz) U (M; — M) e M. 
Hence, by Theorem 3. 
(Mı U Mz) — [(Mi — M3) U (M, — Mı) e 3n. 


This proves Theorem 4, because the set we have described is precisely M; N Mo. 
Now for Theorem 5, which says that if M; and M; do not intersect, then 


m(M, U M3) = mM, + mM. 


First we observe that if P is a polygonal region lying in M; U M 2, then P — 
Pı U Pz, where P, C M; and P; C My: 


M, 
'  Fiaure 229 "^ 
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Therefore 


aP = aP; + aP2 s miM; + miM». 
Therefore 
m(M, U M2) S mM + m1Mo, 


because mz (M, U M3) is the least of the upper bounds of the set of numbers aP. 
On the other hand, given any € > 0, we can find P, C M; and P; C Mə so 
that A 
aP; > mM- 5) aP > miM» — 5° 


We then have 
aP > mM, + miMs — € 
Therefore 
mı(Mı U M») 2 mM, + MM2 — € 


for every e > 0. Hence 

mı(Mı U M2) 2 mM, + miMa. 
Since we already have the reverse inequality, it follows that the equality holds. 
But all three of our sets are measurable. Therefore 


m(M, U M3) = mM, + mM, 


which was to be proved. , 
This is the last algebraic proof in this section; we get the rest of our theorems 


from the earlier ones. 
Proof of Theorem 6. Given M, C M», we have 
M = (M2 — M)U Mi, 


and the two sets on the right do not intersect. Therefore 


mM, = m(Ma — My) + mM, 


so that, 
m(M2 ze M) = mM 5 Y mM, 


as desired. 


Proof of Theorem 7. Given m(M, N M3) = 
m(M, U M3) = mM, + mM». 


0, we want to show that 


First we observe (Fig. 22.10) that 
M, — M2 = Mi — (Mı M3) 


and 
Mz — Mı = M3 — (Mi n M3). 
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ac 


Mi 


Figure 22.10 
Therefore 


m(Mı — M2) = mMı — 0, and  m(M; — Mj) = mM; — 0. 
Now 
Mı U Mz = (M; — Mj) U (Mi n M3) U (M5 — My); 
therefore 
m(M, U M2) = mM, + mM, + 0, 


which was to be proved. 


PROBLEM Ser 22.2 
All of the following problems are to be solved on the basis of the theorems proved in 
this chapter, plus, of course, our old theorems on areas of polygonal regions. 


1. Let C be a circle of radius r, let L be the interior of C, and let K be the disk C U L. 
Whatism;L? What is mol? Why, in each case? 


2. Show that every circle is measurable, and that its measure is = 0. 
3. Show the same, for arcs or circles, 
4. Show that if moM = 0, then every subset of M is measurable, and has measure = 0. 


5. Show that the interior of a triangle is always measurable. What is the measure of 
such a set? 


6. A segment of a circle is a figure like this: 


B 
p] 
PUE NE 

E 4. 
penn 
\ runt 
NM 4 

~ p d 


Figure 22.11 


Prove that a segment of a circle is always measurable, and find its measure. [Warning: 
If K is the sector with boundary are ÁB, and L is the segment, then it is not true that 


L = K — AABC. The first step in solving the problem is to get a correct expression 
for L.] 
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Ficure 22.12 


7. Show that the region indicated in the figure above is measurable, and find its measure. 


22.3 AREAS UNDER THE GRAPHS OF CONTINUOUS FUNCTIONS 


Jordan measure is the theory that is needed to fill a certain gap in elementary 
calculus. Suppose that we have a region R, bounded by the graphs of two con- 
tinuous functions. 


Y 
y=a(x) 


| 
l 
l 
; 
b 
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Here 


f(z) S g(x),  asssb 
and 
= {(@, yla Sz Sb and flr) Sy < g(x)}. 


We compute the area of R by the formula 
b 
mR = | (g(x) — f(0 dz. 


The-usual derivations of this area formula are not proofs in any strict sense because 
they do not appeal to any valid definition of area. "These derivations are extremely 
persuasive. And now that we know about Jordan measure, it is easy to see that 
they really show that the definite integral gives the Jordan measure of the region. 
The situation here is much the same as for circles. The familiar elementary dis- 
cussion becomes quite adequate, as soon as we supply the definition to which it 
tacitly appeals. 
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Figure 22.14 


To see this, consider first the case where R is the region under a single positive 
continuous function f(z) as shown in the figure above. Here f 


{20 aszxsb, 
and 


R= {(z,y)la Sz Sb and 0<y s f(x}. 

As in the definition of the definite integral, we take an ascending sequence of . 
points, a 
@— 1 XX; << Ly b, 


on the interval [a,b]. Let Ax; be the length of the ith subinterval. Let m; be ; 
the minimum value of f(x) on the ith subinterval, and let M; be the maximum - 
value of f(z) on the ith subinterval. (In the figure, we have indicated m; and Mi) — 


Then 
n 
X miii 
ici y 
is the sum of the areas of the inscribed rectangles (with dashed upper bases in the — 
figure) ; and 
n 
5 M jAz 
ici 


is the sum of the areas of the circumscribed rectangles (drawn solid in the figure). — 


There are various ways of setting up the definite integral; we shall not review the 
theory here. But in any case it turns out that 


b 
YmiAzr; S ji f(z) dz S ZM;Arz;. 
And, for continuous functions f(x), we can make the difference, 
XM Az; — YimiAz;, 


as small as we please, merely by taking all of the numbers Az; sufficiently small, 3 
Thus, for any e > 0 there is a sequence, z 


a= Zo <%1 Leie <a, = b, 
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for which 
EM; Az; — Lm;Ar; < e. 


Here the first sum is the area aP of a polygonal region lying in R, and the second 
is the area of a polygonal region P’ containing R. By Lemma 2 of the preceding 
section, this means that R is measurable: m;R = moR. Since the integral is an 
upper bound of the numbers aP, we have 


b 
m;R S Í f(x) dz: 
a 
Similarly, the integral is a lower bound of the numbers aP’, and so 
b 
Í f(x) dz S moR. 
Since m;R = mok = mR, it follows that 


a [10 ds, 


which was to be proved. 

The extension to the case where R is bounded by the graphs of two continuous 
functions is not very hard. We shall not go into it here. 

Many calculus books, including some otherwise excellent ones, undertake to 
define the area of such a region as the definite integral. This will not work. The 
area of a region ought to depend merely on its size and shape, and not on the way 
it is placed relative to the axes. That is, the area of a region ought to be unchanged 
under rigid motions. This is not very clear if we use the definite integral to define 
the area. The trouble is that the same region may be described in infinitely many 
ways as the region between the graphs of two continuous functions. Here Ri 


Y 


y=8,(x) 


Fieure 22.15 
and Rz are isometric. In this chapter of this book we can infer that 
b d 
[Leo — nns = f, tos) = fat de: 
a 


The reason for this inference is that both of the integrals give the right answer to 
the same problem. But if your definition of measure is stated in terms of a co- 
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ordinate system, you are left with the problem of proving by calculus that the in- 
tegrals have the same value; and this is not a very practical enterprise. 

If you review the theorems given in this chapter so far, you will see that Jordan 
measure theory has, at this point, been brought down to earth. Given a figure 
whose measure you would normally expect to calculate, the chances are that you 
can prove that the figure is measurable in the sense of Jordan, by using the theo- 
rems of this section and the preceding one. (In fact, a certain amount of effort is 
required to think of a figure which is not measurable in the sense of Jordan.) 
Moreover, the theory at this point is adequate to justify the elementary methods : 
of calculating plane areas. The theory of measure has been generalized, by Henri 
Lebesgue, in such a way as to assign areas to an even larger class of figures. But 
Jordan measure is quite adequate for the purposes of elementary mathematics. 


PROBLEM Ser 22.3 
1. Given that f(z) is continuous and = 0 fora S z S b. Let 
R = {r,a Sr Sb and 0 < y< f(z)}. 


(Here y < f(z) is not a misprint; the graph itself is not in R’.) Show that R’ is measur- 
able, and that mR’ = mR. 
2. Show that if F is the graph of a continuous function, fora € z < b, then mF = 0. 


3. Show that if R is a region of the sort described at the beginning of this section, then 
mR is the integral of the difference of the two functions, 


tat AA 
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23.1 BASIC ASSUMPTIONS FOR THE THEORY OF VOLUME 


The theory of volume, carried out in a spirit like that of the preceding chapter, 
is technically very difficult. Moreover, the work required is not really worthwhile, 
because anyone pursuing the theory at such length should study not the theory 
of Jordan but that of Lebesgue, which has superseded it for the purposes of ad- 
vanced mathematics. For this reason, we shall treat the theory of volume only 
in a style analogous to that of Chapter 21, basing our derivations on postulates, 
and not attempting to describe a volume function which satisfies our postulates. 

Suppose, then, that we have given a class 'U of sets of points in space, called 
measurable sets. Here U stands for volume. We suppose also that we have a function 


vS0—R 


of V into the nonnegative real numbers. If M € V, then vM will be called the 


volume of M. 
Our first two postulates are designed to ensure that the elementary solids whose 


volumes we propose to discuss really do have volumes. 
V-1. Every convex set is in U. 
V-2. If M and N belong to V, then MUN, MNN, and M — N also belong 
to V. 


The rest of our assumptions deal with the volume function v. 
V-3. vis monotonic. That is, if M,N €, and M CN, then vM $ vN. 
V-4. If M,N € 0, and «M nN) = 0, then 

v(M UN) = vM + oN. 


Obviously, if v is a volume function satisfying these conditions, then we can get 
more such functions by multiplying v by any positive constant. To “determine 


the unit of volume,” we state the following postulate. 
291 
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V-5. The volume of a parallelepiped is the product of 
its altitude and the area of its base. 


Here we are assuming that you know what a parallele- 
piped is. If not, see Section 23.3 below. The figure and 
the formula remind us that any face of a parallelepiped 
can be regarded as the base. 

We recall that triangular. regions with the same base 
and altitude always have the same area. Our final assump- 
tion for volume is analogous to this: 


vK=a-b-c 


FIGURE 23.1 


V-6. Cavalier?'s Principle. Let K and K' be figures in space. Let Eo be a plane. 
If E || Eo, then K N E and K' n E will be called corresponding horizontal cross 
sections of K and K’, respectively. Suppose that 


(2 Kev; 

(2) K'e v; 

(3) every horizontal cross section of K is measurable in its own plane; 

(4) every horizontal cross section of K is measurable in its own plane, and 

(5) corresponding horizontal cross sections D, D' of K and K' have the same 
measure in their plane. Then 


vK = vK'. 


mD-mbD', vK=yK’ 
FIGURE 23.2 


This principle is the key to the study of volume; it deserves a very careful 
reading. Obviously hypothesis (5) is crucial; without it, Cavalieri's principle 
couldn't possibly be valid. But the other four hypotheses are necessary too. In 
fact, if K € V, it does not follow that all horizontal cross sections of K are measur- 
able in their own planes. Also, if all horizontal cross sections of K are measurable 
in their own planes, it does not follow that K is measurable, and similarly, of course, 
for K'. 

Before we can proceed to calculate volumes, we need to do a little preliminary 
geometry. 


— 


dune mr 


u—€—— 
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23.2 CROSS SECTIONS OF CONES AND PYRAMIDS 


Let D be a (circular) disk in a plane E, and let V be a point not in E. The cone 
with base D and vertex V is the union of all segments VP, where P belongs to D. 
The altitude of the cone is the perpendicular distance from V to E. 


Ficure 23.3 


Now let E; be a plane parallel to E, on the same side of E as V, lying at a per- 
pendicular distance k < h from E. Let C be the center of D, and let r be the radius. 
For each point B of D, let B’ be the point in which VB intersects Ey. Let Dy be 
the set of all such points B’. We call D; the cross section of the cone at altitude k. 


Figure 23.4 


Theorem 1. D, is a disk of radius 


Proof. Let A, A’, C and C’ be as in the figure above. Let B be any point of D, 
and let B’ be the corresponding point of D'. Since E, || E, any plane that inter- 
sects Ej and E intersects them in a pair of parallel lines. Therefore 


ÄB | B and BC" || BC. 
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Hence AVA'B' ~ AVAB and AVB'C' ~ AVBC. 
Thus Va Ah—k, AB CB 

VA h AB CB 
Then 


If CB < r, it follows that 
h—k 
hay? 


and conversely. Therefore D; is a disk of radius 
h—k, 
h , 


C'B' s 


r= 
which was to be proved. 
Using the formula mD = mr?, we get the following theorem. 


Theorem 2. Let K be a cone with base area a and altitude h. Let D, be the cross 
section of K at altitude k. Then 


EA 
mb, = (5) a. 


Proof. We know that 
a — Tr and = mDy, = mr, 


where r is the radius of the base, and r' is the radius of Dy. Therefore 
2 INS INS 
mD: = z(t 43) p= (4-4) a, 
which was to be proved. 


If we use a triangular region as base, instead of a disk, we have a very similar 
discussion leading to a very similar theorem. 


y 


> - 


| 


Figure 23.5 
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Let T be a triangular region lying in a plane E, and let V be a point not in E. 
The pyramid with base T and vertex V is the union of all segments VP, where P 
belongs to T. The altitude of the pyramid is the perpendicular distance h from 
V to E. 

Let E, be the plane parallel to E, on the side of E that contains V, such that the 
perpendicular distance between E and E, isk < h: 


FicunE 23.6 


Let the boundary of T be A ABC, and let A’, B, and C” be the points where E; 
intersects VA, VB, and VC. Let T+ be the triangular region corresponding to 
A A'B'C'.. We then have 


AVAD ~ AVA'D', AVBC ~ AVP'C', 
AVAB ~ AVA'B', AVAC ~ AVA’. 


The reason is that if a plane intersects each of two parallel planes, it intersects 
them in two parallel lines. Therefore 


"DA| DA, AB’ | 4B, 
BC’ || BC, AC’ || AC. 
From these parallelisms, our similarities follow. Therefore 


VAS SVD eh =k B'O th ke 


—2 4 


KA iV Die op oh CIRO) PRAE 
AC VA'_h—k A'B' h—k. 
Mac VA xor AB, cath 


Hence 
AA'B'C' ~ AABC, 
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by the SSS similarity theorem (Theorem 3, Section 12.2). And the areas of the 
corresponding regions T, T; are related by the formula 


fang 
mT, = (o mT. 


(This follows from Theorem 8, Section 13.2.) Thus we have the following 
theorem. 


Theorem 3. Let P be a pyramid with base area a and altitude h. Let T+ be its 
cross section at altitude k. Then 


IEN 
n7, = (25-4) a. 


23.3 PRISMS AND CYLINDERS 


Given a pair of parallel planes Z,, E», a set B, lying in Æ, and a line L which 
intersects E; in exactly one point: 


Figure 23.7 


Through each point P of B, we take a segment PP’, parallel to L, with P’ in E». 
The union K of all such segments PP’ is called a cylinder; B, is called its lower 
base, and L is called its directrix. The intersection Ba of the cylinder with E» 
is called its upper base. (Thus Bg is the set of all points P’.) The perpendicular 
distance between E, and E; is called the altitude of the cylinder. And the seg- 
ments PP’ are called elements of the cylinder. 

If B, is a disk, then K is called a circular cylinder. If B, is a polygonal region, 
then K is called a prism. If L is perpendicular to E,, then K is called a right 
cylinder. If a cylinder has a parallelogram region as its base, then it is called a 
parallelepiped. If P and Q are points of B,, and P' and Q' are the corresponding 
points of B5, then it is not hard to see that []PP’Q’Q is a parallelogram. 
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Proof. The lines PP’, 0Q’ are parallel, because they are both parallel to L 
(unless one of them is =L, in which case the same conclusion follows). Therefore 
PP’ and QQ’ are automatically coplanar, lying in a plane E; E intersects each of 
the planes E,, E», and E therefore intersects them in a pair of parallel lines. There- 
fore PQ || PQ. Therefore [-]PP'Q'Q is a parallelogram. Thus we have the follow- 


ing theorem. 


Theorem 1. In any cylinder, the lower base and the upper base are isometric. 


Proof. In defining the cylinder, we used a correspondence P <> P' between the 
upper base and the lower base. Since opposite sides of a parallelogram are con- 
gruent, we always have 

PQ = P'Q'. 


Therefore our correspondence is an isometry. 

In particular, if the cylinder is circular, the two bases are always disks with the 
same radius. If the cylinder is a prism with a triangular base, then the two bases 
are always congruent. If the cylinder is a prism, with any polygonal region what- 
ever as base, then the two bases, being isometric, always have the same area. All 
of this is conveyed by Theorem 1; the basic idea here has nothing to do with the 
shape of the base. > - 

By a horizontal cross section of a cylinder, we mean the intersection of the cylinder 
with a plane parallel to the base. 


FIGURE 23.8 


Theorem 2. All horizontal cross sections of a cylinder are isometric. 


This follows immediately from Theorem 1. To see this, we merely need to observe 
that every horizontal cross section B is the upper base of a cylinder with base B,. 

From Theorem 2 we get the same sort of special conclusions that we got from 
Theorem 1, i.e., if the base is a disk then all the horizontal cross sections are disks 
with the same radius; if the base is triangular, then all of the cross sections are 
congruent; and if the base is a polygonal region, then all of the horizontal cross 
sections are polygonal regions with the same area. 
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23.4 VOLUMES OF PRISMS AND CYLINDERS 
To make use of the additivity postulate V-4, we need the following theorem. 


Theorem 1. If M € U, and M is a bounded set lying in a plane, then vM = 


Proof. M, being bounded, lies in a rectangular region R in its own plane. Let 
aR — ab. Then for every positive number h, M lies in a parallelepiped K, with 


vK — abh. 


Obviously vK is as small as we please if h is small enough. More precisely, given 
any e > 0, we have 


vK = abh « e, 
whenever 2 
À« zi 
Therefore 
| vM iK Se 


for every e > 0. Therefore »M = 0. 


Theorem 2. Let K be a right prism, of altitude h, with a right triangular region 
T as base. 


Figure 23.9 


Then vK = h- aT. 


Proof. Construct another right prism K’ with base 7”, in such a way that T U T 
is a rectangular region, and K U K' is a parallelepiped. If a and b are as in Fig. 
23.9, then 


v(K U K’) = abh. 
By Cavalieri’s principle 
vK = 0K’. 

By Theorem 1, 

v(K n K’) = 
Therefore 

v(K U K’) = vK + vK' = WK. 

Hence 


vK = iw(K u K^) = habh = 4h aT, 
which was to be proved. 
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More generally, we have the following theorem. 


Theorem 3. Let K be a right prism, with lower base B and altitude A. 


Then 
vK = h- aB. 


Fieure 23.10 


Proof. B is the union of a finite collection of triangular regions T;. Thus K is 
the union of a finite collection of prisms K; with the Ts.as bases. We know by 


Theorem 2 that 
vK; =h- aT; 


for each i. And 
vK = D vK ;, 
i=l 


because each K; intersects the others in a set of volume = 0. Therefore 


wa Y hai 


i-i 
=h- ali 
tol 

, = h- aB. 

Theorem 4. Let K be any prism (right or not). Then vK is the product of the 

altitude and the base area. 

Proof. Let K’ be a right prism, with the same base area as K and the same 
altitude. 
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By Theorem 3, 
vK’ = h- aB' = h- aB. 
By Cavalieri’s principle 
vK' = vK. 
Therefore 
vK = h- aB, 
which was to be proved. 
For cylinders with circular bases, the volume formula and its derivation are the 
same. 


Theorem 5. Let K be a cylinder whose altitude is h and whose base is a disk of 
radius r. Then 


vK = mr?h. 
To prove this, we take a right prism L, with base area 77? and altitude h. Then 
vL-= mr?h. 


By Theorem 2, Section 23.3, all horizontal cross sections of K and L have the 
same area. Therefore, by Cavalieri’s principle, we have 


vK = vL — trh, 
which was to be proved. 
23.5 VOLUMES OF PYRAMIDS AND CONES 


Theorem 1. Given two pyramids with their bases in the same plane and their 
vertices on the same side of this plane. If they have the same base area and the 
same altitude, then they have the same volume. 


>= 
Òi iamnan; 


Figure 23.12 


Proof. Let the common base area be a, and let the common altitude be h. By 
Theorem 3, Section 23.2, the horizontal cross sections at height k have the same 


area, namely 
ak = h a. 


By Cavalieri's principle, the two pyramids have the same volume. 


— CQ 
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FIGURE 23.13 


We can now compute the volume of a pyramid, by the following ingenious 
device. Consider first a right prism K with a triangular base in the figure above. 
The figure below indicates how the prism can be expressed as the union of three 


triangular pyramids: 
B A B A 
SA. — 
F F 
K, K; 


3 


K, 2 
Figure, 23.14 


Now ABCF œ ABEF. If we regard K, and K as pyramids with A as “top 
vertex,” then Kı and Ky have the same base area, and they have the same altitude, 
because the altitude of each of them is the perpendicular distance from A to the 
plane that contains B, C, E, and F. By the preceding theorem, we have 


vK, = 0K. 


Now let us regard Kz and K3 as pyramids with F as “top vertex." Then Ke 
and K; have the same base area, because A ABE & AADE, and they have the 
same altitude, because the altitude of each of them is the perpendicular distance 
from F to the plane that contains A, B, D, and E. By the preceding theorem, we 


have 
vK5 = vK3. 


Since Kı, Ks, and Ka intersect each other only in planar sets, which have 


volume = 0, we have 
vK = vK, + vK3 + vK3 


= 30K 2. 
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Fieure 23.15 


Suppose now that we start with a pyramid L with a triangular base. Let K3 
be a pyramid with the same base and altitude, with a vertical edge AD, as in 
Fig. 23.14. And let K be a right prism with the same base and the same altitude 
as Ks. Then 


vK = wK; 
= SL. 
Therefore 
vL = iK 
E jah, 


where a and h are the base area and altitude of L. Thus we have the following 
theorem. 


Theorem 2. The volume of a pyramid is one-third the product of its base area 
and its altitude. 


| 
| 
n 

hi 
| 
i 
I 
b 


vK= jah 


Figure 23.16 


If we use the full force of Cavalieri’s principle, then the corresponding formula 
for the volume of a cone becomes almost trivial. Given a cone with base area 
a = mr’ and altitude h (Fig. 23.17). Take any pyramid with its base in the same 
plane, with the same base area a and the same altitude h. For the two figures il- 
lustrated in Fig. 23.17, cross sections at the same height have the same area; by 
the theorems of Section 23.2, the cross sections at height k have area 


ag 
ay = (^ i t) a. 
By Cavalieri's principle, the cone and the pyramid have the same volume. Since 
we know that the volume of the pyramid is 4ah, we have the following theorem. 
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FIGURE 23.17 


Theorem 3. The volume of a cone is one-third the product of its base area and 
its altitude. 


Your only “problem” for this section is the following project. Close the book; 
draw a figure showing the decomposition of the right prism K into the three pyra- 
mids Kı, Kz and Ks; and then draw figures showing the three pyramids sepa- 
rately. If you can do this, the chances are that you understand this whole section, 


and conversely. 


23.6 THE VOLUME OF A SPHERE 


To find the volume of a sphere, we first cireumscribe a cylinder around it: 


Ficure 23.18 


Let K be the sphere plus its interior; let C be the circumscribed cylinder, and let 


beg 


hat 
pex C=KUL. 


We know how to find vC: 
ova vC = mr? -2r 

= 27r. 
Therefore, if we can find vL, we can calculate »K by subtraction as 


~ vK = rC — vL. 
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In calculating vL, we shall use Cavalieri’s principle in the same way as in the last 
two sections. That is, we shall find a figure whose volume we know, which has 
the same cross-sectional areas as L. 

The areas of the cross sections of L are easy to calculate. Figure 23.18 shows 
a vertical cross section of our sphere and its circumscribed cylinder. Thus the 
cylinder appears as a rectangle and the sphere appears as a circle. The cross sec 
tion of L at height k is a figure like this: 


Figure 23.19 
The outer radius is r, and the inner radius is 
a E F, 
by the Pythagorean theorem. Therefore the cross-sectional area of L at height kis 


aj = mr? — ms? 


= m(r?— s?) 
= mir? — (r? — xk 
= vk? 


Now consider the figure L’ shown below. On the right we show a vertical cross 


Ficure 23.20 


section. The cross section of L at height k is a disk of radius k. Therefore 
cross-sectional area at height k is 


a, = mnk?. 


Thus 
vL = vl. 
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But vL! = 2-4ar?-r 
= ĝrr?. 
Therefore 
vL = ¥rr’*. 
Hence 
vK = vC — vL 
= 2:1! — ar? 
= $nr°. 


For reference, we write this as a theorem. 
Theorem 1. The volume of a sphere of radius r is $rr?. 


PROBLEM Ser 23.6 


1. If we pass a secant plane through a sphere, it cuts the sphere into two parts, one of 
which is lens-shaped like this: 


FIGURE 23.21 


Such a figure is called a spherical segment. Find the volume in terms of r and s. 
2. By a spherical sector we mean & figure like this: 


/r 


FIGURE 23.22 


On the right we show & vertical cross section, which forms a sector of a disk. Find the 
volume in terms of r and s. 


Bum 24 
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24.1 ABSOLUTE GEOMETRY, CONTINUED: THE CRITICAL FUNCTION 


In this chapter, we shall make heavy use of the incidence and separation theo- 
rems of Chapter 4. For convenience, we briefly restate two of them: 


The Postulate of Pasch. Given AABC and a line L (in the same plane). If L 
intersects AB at a point between A and B, then L also intersects either AC or 
BC. 


C 
A 
L? L? Z 
Figure 24.1 


(This was Theorem 14, Section 4.1.) 


The Crossbar Theorem. If D is in the interior of ZBAC, then AD intersects 
BC. 


FIGURE 24.2 


(This was Theorem 4, Section 4.3.) 


Given a line L and an external point P. Let A be the foot of the perpendicular 
from P to L, and let B be any other point of of L (Fig. 24.3). For each number r 
between 0 and 180 there is exactly one ray PD, with D on the same side of AB P as 
B, such that 

mZAPD =r. 
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P 
E D 
n r 
I 
Spt kG tcv ibt 
C A B h 
Ficure 24.3 


Obviously, for some numbers Tj PD will intersect AB. (For example, take r = 
mZ APB.) Forr Z 90, PD will not intersect AB. Let 


K= (r|PD intersects AB}. 
Then K is nonempty, and has an upper bound. Therefore K has a supremum. Let 
ro = sup K. 
The number rọ is called the critical number for P and AB. The angle Z APD with 
measure — ry is called the angle of parallelism of AB and P. 
— t 
Theorem 1. If mZ APD = ro, then PD does not intersect AB. 


— — 
Proof. Suppose that PD intersects AB at Q: 


FIGURE 24.4 


It R is any point such that A-Q-R, then mZ APR > To, $0 that ro is not an upper 
bound of K. 
—. QE 
Theorem 2. If mZ APD < ro, then PD intersects AB. 


Ficure 24.5 
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Proof. Since ro = sup K, and mZ APD < ro, it follows that mZ APD is not 
an upper bound of K. Therefore some rit in K is >mZ APD. let D' be such that 


mZ. APD' = r. Then PD intersects AB i ina point F. F. But PDi is in the interior 
of 4 ZAPD’'. Therefore, by the crossbar theorem, PD intersects AF. Therefore 
PD intersects AB. Thus th there is a certain “critical ray " PD, with mZAPD = = tay 
PD does not intersect AB, but if F is in the interior of ZAPD, = PF does 
intersect AB. 


(Hereafter, if F is in the interior of Z APD, we shall say for short that AF is an 
interior ray of ZAPD.] 


P 
I D 
I 
i F 
! F 
A 
FicunE 24.6 


We note that ro was defined in terms of P, A, and B. It turns out, however, that 
ro depends only on the distance AP. 


Theorem 3. Let P, A, B and also P’, A’, B' be as in the definition of the critical 
number. If AP = A’P’, then the critical numbers ro, rj are the same. 


P’ 


Ficure 24.7 


Proof. Let 
K= (PD intersects AB}, 


and let 
—> —* 
= (r|P' D' intersects A'B'), 


as before. If If r € K, let Q be the point where PD intersects AB, and let Q' be the 


point of AB for which A’Q’ = AQ. Then mZ A'P'Q' = r. (Why?) Therefore 
r € K'. Thus K C K'; and similarly K’ C K. Therefore 


K'— K and sup K’ = supK. 


We now have a function AP — ro. We shall denote this function by c, and call 
it the critical function. Thus, for every a > 0, c(a) denotes the critical number 
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Fieure 24.8 


corresponding to AP = a. Thus PD intersects ‘AB when mZAPD < c(a), 
but PD does not intersect AB when mZ APD 2 c(a). 


We shall now investigate the function c. 


Theorem 4. c never increases as a increases. That is, if a’ > a, then c(a’) S c(a). 
P 
i D 
Pi (a) 
: ay? D 
| 
s sud d dicus n lm 
A B 
~Fieure 24.9 


Proof, Given P, P’, with a = AP, a! = AP’, asin Fig. 249. Take PD so that 
mZAPD = c(a), and take P'D' so that mZ AP'D' = c(a). Then PD and PD’ 
are parallel. Therefore all points of P’D’ are on the side of PD that contains P'. 
And all points s of AB are on the side of PD that contains A. Therefore TP'D' does 
not intersect AB. 


Now let 
K' = (r[P/D? intersects AB}, 


as in the definition of the critical angle, so that 
c(a’) = sup K’. 


Then c(a) is an upper bound of K’, because P'D’ does not intersect AB. And 
c(a’) is the least upper bound of K’. Therefore c(a’) = c(a), which was to be 
roved. 

5 In the Euclidean case, this theorem cannot be strengthened to give the strict 
inequality c(a’) « c(a) for a’ > a. (The reason, obviously, is that in Euclidean 
geometry we have c(a) — 90 for every a.) In hyperbolic geometry, however, not 
only do we have the strict inequality but we actually have c(a) — 0 as a — oo. 
(It would be worthwhile to figure out how this works in the Poincaré model de- 
scribed in Chapter 9.) 
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Theorem 4 allows the possibility that c(a) < 90 when a is large, but c(a) = 90 
when a is sufficiently small. But in fact this cannot happen, as the following two 
theorems show. 


Theorem 5. If c(a) < 90, then c(a/2) < 90. 
P 


| MIS 
Aa 
€ 
$ 
> 


4 
na ie 
i 
y 

| 

| 
SI 

I 

l 

I 

[j 

l 

i 


FicunE 24.10 


Proof. Given P, P' as in the figures, with AP = a, AP’ = a/2. Take PD so 
that mZAPD = c(a) < 90, and take PÈ L AP at P’. If PB fails to inter- 
sect P'E, as on the left, then there is an acute angle Z AP’D’ such that P’D’ fails to 
intersect AB. (Proof ?) Therefore c(a/2) S c(a) « 90. 

Suppose, then, that PD does intersect P'É, at a point F. (It will turn out, later 


in the theory, that this is what always happens.) Let G be any point such that 
P-F-G. Then Z AP'G is acute. 


Now (1) AB cannot intersect P'G except perhaps in a point of P’G; the reason 
is that all other points of P'G lie on the “wrong side" of PẸ. And (2) AB does not 
contain P’ or G. Finally (3) AB does not contain a point between P’ and G; if so, 


it would follow from the postulate of Pasch that AB intersects P’F or FG, which 
is false. 


Therefore P/G does not intersect AB, and c(a/2) < 90, which was to be proved. 


Theorem 6. If c(ag) < 90 for some ao, then c(a) < 90 for every a. 
Proof. For each n, let 
an 


Il 
"js 


By induction based on Theorem 5, we have 


c(a.) < 90. for every n. 
Suppose now that c(b) — 90 for some b. Since 


lim a, = 0, 
nn 


we have 
a; <b for some k. 


Thus a, < b but c(a) < c(b), and this contradicts Theorem 4. 


% | ABSOLUTE GEOMETRY: OPEN TRIANGLES, PARALLEL RAYS = 31! 
| This theorem clarifies the meaning of the parallel postulate; it tells us that the 
situation described in the postulate holds either always or never. 


Theorem 7. The All-or-None Theorem. If parallels are unique for one line and 
one externa! point, then parallels are unique for all lines and all external points. 


| Proof. Given P, L, P’, and L', with AP = a and A'P' — a', as in the figure. 
à A 
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Figure 24.11 


It is now easy to see that each of the statements below is equivalent to the next. 


(1) There is only one parallel to L through P. 

(2) c(a) = 90. 

(3) c(a') = 90. 

(4) There is only one parallel to L’ through P”. 

Therefore (1) and (4) are equivalent, which was to be proved. 

Thus we can state our two possible- parallel postulates in seemingly weak but 
actually quite adequate forms. 


(I) Euclidean. For some line and some external point, parallels are unique. 
(II) Lobachevskian. For some line and some external point, parallels are not 


unique. 

We have already remarked, in Chapter 9, that the same all-or-none principle 
applies in other connections. If the formula mZ A 4- mZ.B + mZC = 180 holds 
for even one triangle, then it holds for all triangles; if even one pair of triangles are 
similar without being congruent, then the geometry is Euclidean; and so on. 


24.2 ABSOLUTE GEOMETRY: OPEN TRIANGLES AND CRITICALLY PARALLEL 
RAYS 
— — — : 
Given rays AB, PD and the segment AP, no two of | these figures being col- 
linear. Suppose that B and D are on the same side of AP, and that AB || PD. 


Then PD UPAU AB is called an open triangle, and is denoted by ADPAB. 
P D 


A B 


Figure 24.12 
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Here, when we write AB Il PD, we mean that the lines are parallel in the usual 
sense of not intersecting one another. 

Suppose now that A DPAB is an open triangle, and every interior ray of L APD 
intersects AB 


n D 


B 


> 


Figure 24.13 


We then say that PDis critically parallel to AD, and we write PD|AD. Here the 


single vertical stroke is supposed to suggest that PD is parallel to AD with no 
room to spare. | 


Note that PD and AB do not appear symmetrically i in this definition. Thus if 
PD|AB, it does not immediately follow that ABI|PD. Note also that the relation 


PD|AB (as we have defined it) depends not only on the "directions" of the two 
rays, but also on the initial points: 


A’? A A’? B 
Figure 24.14 


META N -o 

Thus, if PD|AB, we cannot conclude immediately that P'DJA'B. We shall see, 
however, in the next few theorems, that the above conclusions follow, non- 
immediately. 


Theorem 1. If PD|AB, and C-P-D, then CD| AB. 


c P 


D 


A B 
Figure 24.15 


Proof. Let CE be an interior ray of Z ACD, and suppose that CE does not in- 
tersect AB. By the exterior angle theorem (which was, fortunately, proved with- 
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out the use of the parallel postulate), we know that ZAPD > ZACD. Therefore 
there is an interior ray PF of ZAPD such that ZDPF œ ZDCE. Therefore 
Pr Il GÈ. Therefore PF does not intersect AB, because these rays lie on opposite 
sides of È. This contradicts the hypothesis PD|AB. 


—,.—À ——À 
Theorem 2. It PD|AB, and P-C-D, then CD|AB. 


Ficure 24.16 


We give the proof briefly. Suppose that there is an interior Tay CE of ZACD 
such that CE does not intersect AB. Let F be any point of CH — C, and take 
G so that P-F-G. Then 

(1) F is in the interior of ZAPC; 

(2) PF does not intersect AB; 

—, — 

(3) FG does not intersect AB ; 

— 
(4) PF does not intersect AB. 


—, — 
Statements (1) and (4) contradict the hypothesis PD|AB. 
Two rays R and R’ are called equivalent if one of them contains the other. We 
then write R ~ R’. Obviously the symbol ~ represents an equivalence relation. 
Fitting together the preceding two theorems, we get: 


— : 272 
Theorem 3. If R|AB, and R and R' are equivalent, then R’|AB. 


Somewhat easier proofs show that the relation PD|AB depends only on the 
equivalence class of AB. We leave these proofs to you. 


Theorem 4. If R;|Ro, Ri ~ Rı and R} ~ Ro, then Ri| R2. 


> 
Given PDIAB, let C be the foot of the perpendicular from P to AB, and let 
PC = a. 
D 


c(a)? 
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[ 
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| 
A Cc B 
Fieure 24.17 
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Then PD|CB (providing, of course, that B is chosen so that A-C-B, as in the 

figure). Therefore mZCPD = c(a). Now on the side of PC that contains B 
— 

there is only one ray PD for which mZCPD = c(a). Thus we have: 


Theorem 5. The critical parallel to a given ray, through a given external point, 
is unique. 


Two open triangles are called equivalent if the rays that form their sides are 
equivalent. An open triangle ADPAB is called isosceles if ZP S Z A. 


d 
?- D 


A B 
FicunE 24.18 


Theorem 6. If PD|AB, then ADPAB is equivalent to an isosceles open tri- 
angle which has P as a vertex. 


Figure 24.19 


Proof. Since PD|AB, the bisecting ray of Z APD intersects AB in a point Q. 
By the crossbar theorem, the bisecting ray of Z PAB intersects PQ at a point R. 


Let S, T, and U be the feet of the perpendiculars from R to PD, AB and AP. 
Then RU = RT and RU = RS. Therefore RS — RT, and ZRST = ZRTS. 
Hence (by addition or subtraction) Z DST œ ZBTS; and ADSTB is isosceles. 


To make P a vertex, we take V on the ray opposite to TB, such that 7V = SP. 


r 
S 


F. Z 
FicvnE 24.20 
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Theorem 7.. Critical parallelism is a symmetric relation. That is, if PDI|AB, 
then AB|PD. 


Proof. By Theorems 4 and 6, we may suppose that ADPAB is an isosceles open 
triangle: 


P D 


FIGURE 24.21 


Let AE be any interior ray of ZPAB. Let PF be p an interior ray of Z APD, 
such that Z DPF œ ZBAE. Then PF intersects AB at a point Q. It follows 
that AÈ intersects PD at the point S where PS = AQ. (Proof? The whole 
figure is symmetric, from top to bottom.) 


Theorem 8. If two nonequivalent rays are parallel to a third ray, then they are 

parallel to each other. 

RESTATEMENT. If ABICD, CDIEF, and AB and EF are not equivalent, then 

AB[EF. 

Proof. (1) Suppose that AB and EF lie on opposite sides of CD. Then AE 
intersects CD, and by Theorem 4 we can assume that the point of intersection is C. 


FIGURE 24.22 


Let AG be any interior ray of ZEAB. Then AG intersects CD at a point H. 
Take I so that C-H-I and take J so that A-H-J. Then HI|EF, by Theorem 4; 
and HJ is an interior ray of ZHEF. Therefore HJ intersects EF at a point K. 


Therefore AG intersects EF, which was to be proved. 
(2) It CD and Ef are on opposite sides of AB, then the same conclusion follows. 
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Figure 24,23 


Here we may suppose that AB n EC = A, for the same reasons as in the first 
case. Through E there is exactly one ray EF critically parallel to AB. By the 
e. —À— L— — 
result in Case (1), EF'|CD. Since critical parallels are unique, LF’ = EF and 

EF|AB, which was to be proved. 

There remains a sticky point which some authors have overlooked. How do 
we know that some two of our three rays lie on opposite sides of the line contain- 
ing the third? In the Euclidean case, this is easy to see, because any three parallel 
lines have a common transversal; in fact, any line which crosses one of them must 
cross the other two. But this is far from being the case in hyperbolic geometry, as 
an examination of the Poincaré model will easily show. Given three noninter- 
secting lines, it can easily happen that every two of them are on the same side of 
the third. Therefore the conditions AB || CD, OD || EF are not enough for our 
purpose ; to get a valid proof, we need to use the full force of the hypothesis ABICD, 


— 


CD|EF. We shall show, under these conditions, that (3) some line intersects all 
three of the rays AB, CD, EF. (Surely this will be adequate.) 


Figure 24.24 


If A and E are on opposite sides of CD, then AB intersects CD and (3) follows. 
Suppose, then, that (a) A and E are on the same side of CD. If A and D are on the 
same side of EC, then CA is an interior ray of ZC, so that CA intersects EP and 
(3) follows. If A lies on Ch, then (3) holds. We may therefore suppose that (b) A 
and D are on opposite sides of CE. Therefore AD intersects CE at a point G. 

Take H so that C-D-H. Then DH|AB. By the exterior angle theorem, 
“HDA > ZC. Therefore there is an interior ray DI of ZHDI such that 
£HDI 2 ZC. Then DI | CÉ, but Di intersects AB at a point J. 
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Now CÉ intersects AD at G. Therefore CE intersects another side of AADJ. 
Since CE does not intersect Di, CE intersects AJ at a point K. Now (3) follows; 
the line that we wanted is CE. 

The oversight leading to the incomplete proof of this theorem is rather illus- 
trious. It is due originally to Gauss, and has been faithfully reproduced by good 


authors ever since. 
PROBLEM Ser 24.2 


1. By the interior of an open triangle A DP AB, we mean the intersection of the interiors 
of ZP and ŻA. Ifa line intersects the interior of an open triangle, does it follow that the 
line intersects one of the sides? Why or why not? 

2. Same question, for the case where PD|AB. 


24.3 HYPERBOLIC GEOMETRY: CLOSED TRIANGLES AND ANGLE SUMS 


So far in this chapter, we have been doing absolute geometry. To mention the 
hyperbolie parallel postulate in our proofs would have been misleading, because in 
the Euclidean case, our theorems, so far, become not false but merely trivial, and 
the difference between falsity and triviality is important. 

In this section we deal specifically with the hyperbolic case. To avoid confusion, 
throughout this chapter, we shall mention the hyperbolic parallel postulate in every 
theorem whose proof requires it. We shall abbreviate the name of the postulate 


as HPP. 
P 


A B 


Figure 24.25 


1t PD|AB, then ADPAB is called a closed triangle. 

Note that every closed triangle is an open triangle, but the converse is false, 
because through P there is more than one line parallel to "AB. Closed triangles have 
important properties in common with genuine triangles. 


Theorem 1. The Exterior Angle Theorem. Under HPP, in every closed triangle, 
each exterior angle is greater than its remote interior angle. 


— - 
Restatement. If PD|AB and Q-A-B, then ANAP SLE. 
P 
D 


Q A B 
Fieure 24.26 
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P D 
Q 4 E 
Figure 24.27 


Proof. If ADPAB is isosceles, this is obvious. Here, if HPP holds, then ZP 
and ZPAB are acute (because c(a) « 90 for every a), and therefore ZQAP is 
obtuse. 

Suppose then that ADPAB is not isosceles. By Theorem 6, Section 24.2, 
ADPAB is equivalent to an isosceles open triangle A DPCB, and this open triangle 
is also closed: 


If C = A, there is nothing to prove. For the case A-C-B, let the degree measures 
of the various angles be as in the figure. Then 


por, 
because c(a) < 90. And 
p+q+s S 180, 


by Theorem 6, Section 10.4. Therefore 


t = 180-—qept+sr>r+s, 
and 
t> r+ 8, 


which proves half of our theorem. 
To prove the other half, we need to show that u > q. This follows from 


t = 180 — q > 180 — u =r +s. 
We found, in Theorem 3, Section 24.1, that the critical function c was nonincreas- 


ing. That is, if a’ > a, then c(a’) S c(a). Using the exterior angle theorem, we 
can sharpen this result. 
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FIGURE 24.29 


Theorem 2. Under HPP, the critical function is strictly decreasing. That is, if 
a’ > a, then c(a") < c(a). 
— — ——> > 
. Proof. In the figure, AP = a and AP’ = a’, PD|AB and P'D'|AB, so that 
PD|P'D'. Therefore AD'P'PD is a closed triangle. Therefore c(a) > c(a’), which 
was to be proved. 


Theorem 3. Under HPP, the upper base angles of a Saccheri quadrilateral are 
always acute. : 


Ficure 24.30 


(We already know, from Chapter 10, that they are congruent, and cannot be 
obtuse.) a} "y. 
In the figure, BQ and GP are the critical parallels to AD, through A and C. 


Therefore 
mZ ABQ = c(a) = mZDCP, 


as indicated. Applying the exterior angle theorem to the closed triangle APCBQ, 
we see that 
08. 
Therefore 
t+ cla) > s+ cla). 
Therefore 
s + c(a) < 90, 
which proves our theorem. 
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FIGURE 24.31 


Theorem 4. Under HPP, in every right triangle A ABC, we have 
mZA+mZB+mZC < 180. 


Proof. Suppose not. Then, if ZA is the right angle, ZB and ZC must be 
complementary. Take D on the opposite side of BÓ from A, so that ZBCD = 
ZABC and CD = AB. Then AABC = ADCB, by SAS; and [ JABDC is a 
Saccheri quadrilateral. This is impossible, because Z D is a right angle. 


Theorem 5. Under HPP, for every triangle A ABC, we have 
mZA+mZB+mZC < 180. 


Figure 24.32 


Proof. Let AC be a longest side of A ABC, and let BD be the altitude from B 
to AC. Then 
r d-s-- 90 < 180, 
and 
t+u-+90 < 180. 
Therefore 
r+ (s+ t)+u < 180, 


which proves the theorem. 

Soon we shall see that under HPP this theorem has a true converse: for every 
number z < 180 there is a triangle for which the angle sum is z. Thus 180 is not 
merely an upper bound for the angle sums of triangles, but is precisely their su- 
premum. 
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24.4 HYPERBOLIC GEOMETRY: THE DEFECT OF A TRIANGLE AND THE COLLAPSE 
OF SIMILARITY THEORY 


The defect of A ABC is defined to be 
180 — mZA — mZB — mZC. 


The defect of AABC is denoted by 4^ ABC. Under HPP we know that the defect 
of any triangle is positive, and obviously it is less than 180. (Later we shall see 
that the converse holds: every number between 0 and 180 is the defect of some 
triangle.) 

The following theorem is easy to check, regardless of HPP. 


Theorem 1. Given A ABC, with B-D-C. Then 
5A^ABC = 6AABD + SADC. 


A 


B D c 
Figure 24.33 


It has, however, an important consequence. 


Theorem 2. Under HPP, every similarity is a congruence. That is, if AABC ~ 
ADEF, then AABC = ADEF. 


FiaunE 24.34 


First we take G on AB so that AG = DE; and we take H on AC so that AH = 
DF. We then have AAGH & AEDF, by SAS; therefore 


AAGH ~ AABC. 


If G = B, then H = C, and the theorem follows. We shall show that the contrary 
assumption G # B, H # C (as shown in the figure) leads to a contradiction. 
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Let the defects of AAGH, AGHC, and AGBC be di, do, and da, as indicated 
in the figure; let d be the defect of A ABC. By two applications of the preceding 
theorem, we have d = d, + dy + da. This is impossible, because the angle con- 
gruences given by the similarity AABC ~ AAGH tell us that d — di. 

The additivity of the defect, described in Theorem 1, gives us more information 
about the critical function c, What we know so far is that (1) 0 < c(a) < 90 for 
every a > 0, and (2) c decreases as a increases. There remains the question of how 
small the numbers c(a) eventually become when a is very large. We might have 
either of the following situations: 


Y 


90 y-c(x) 


e=0 


Figure 24.35 


In each case, e = inf {c(a)}, that is, the greatest lower bound of the numbers c(a). 
In each case, it follows from (2) that lim, ,4 c(a) = e. To prove the following 
theorem, therefore, we need merely show that e > 0 is impossible. 


Theorem 3. lima,» c(a) = 0. 
Proof. Suppose that c(a) > e > 0 for every a. 


Fieure 24,36 


The markings in the figure should be self-explanatory. For each n, P,Q, inter- 
—> 
sects Pol, because e < e(n). The right triangles A P,P, ,Q, ,, are all congruent, 
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Figure 24.37 


and therefore have the same defect dg. Consider now what happens to the defect 
d, of APSP,R, when n is increased by 1. The letters in the interiors of the tri- 
angles denote their defects. We have 


5bAP oPpRa+1 = dn + Y, 
bAPn4i1PaARn41 = do + 2, 
An 41 = (dn ate y) Lc (do "E 2); 


by Theorem 1 in each case. Therefore 


dn41 > dn + do. 
Thus 
dg > di-Fdo, ds > d2 + do > di + 2doj 


and by induction, we have 
dy > dı + (n "T 1)do. 


When n is sufficiently large, we have d, > 180, by the Archimedean postulate. 
This is impossible, because the defect of a triangle is 180 minus the angle sum. 
Therefore c(a) > e > 0 is impossible, which was to be proved. 

Consider now what happens to the measure r(a) of the base angles of an isosceles 
right triangle, as the length a of the legs becomes large. 


Ficure 24.38 
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r(a)° 
D a A a [el 


Figure 24.39 


Here BDIAC. Therefore we always have r(a) < c(a). Therefore lima,» r(a) = 0. 
Let us now make the figure symmetrical by copying A ABC on the other side of 
AB. For ADBC, the angle sum is 4r(a). Therefore the defect 180 — 4r(a) can 
be made as close to 180 as we please; we merely need to take a sufficiently large. 
Thus 180 is not merely an upper bound of the numbers which are the defects of 
triangles; 180 is precisely their supremum. 


Theorem 4. For every number z < 180 there is a triangle whose defect is greater 
than z. : 


24.5 ABSOLUTE GEOMETRY: TRIANGULATIONS AND SUBDIVISIONS 


Let R be a polygonal region. As in Chapter 14, by a triangulation of R we mean 
a finite collection, 
K = ATG Ts. US, 


of triangular regions 7’;, such that (1) the T's intersect only at edges and vertices, 

and (2) their union is R. A collection K which satisfies (1) is called a complex. 

Evidently every complex K forms a triangulation of the union of its elements. 
Given two complexes 


K= aq, Tz, tet) T); 
KATY, T$, ood Th), 


If every T; lies in some one of the sets T;, then K’ is called a subdivision of K. 
Triangulations of certain types will be especially useful. 

Given a polygon with vertices P}, P2,..., Pn. Suppose that for each pair of 
successive vertices P;, P;,; all other vertices of the polygon lie on the same side 


Ficure 24.40 


A "a - 
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of PP Then the polygon is conver. From this it follows that if P; P;,1, 
P;,2 are successive, then the other vertices (if any) all lie in the interior of 
ZP;P;i41Pi+2. (To get this, we merely apply the definition of the interior of an 
angle.) By the interior of a convex polygon, we mean the intersection of the interiors 
of its angles. By a convex polygonal region, we mean the union of a convex polygon 
and its interior. By a star triangulation of such a region, we mean a complex like 
this: 


Figure 24.41 


(More precise definition?) Obviously every convex polygonal region has a star 
triangulation; and any point P of the interior can be used as the central vertex. 


Theorem 1. Let R be a convex polygonal region, and let L be any line which 
intersects the interior of R. Then L decomposes R into two convex polygonal 
regions. 


Figure 24.42 


Proof. Let H, and H be the half planes with L as edge; let Hı = H, U L and 
let Hp = Ha U L. Let 


Rı= RnH, R= RNH. 


Then R, and Ro are convex sets because each of them is the intersection of two 
convex sets; and it is easy to check that they are convex polygonal regions in the 
sense that we have just defined. 

We shall use this as a lemma in proving the following theorem. 


Theorem 2. Every two triangulations of the same polygonal region have a com- 
mon subdivision. 
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That is, if Ki and Kg are triangulations of R, then there is a triangulation K 
of R which is a subdivision both of K, and Ko. 


LENS 


Figure 24.43 
In the figure, the edges of K, are drawn solid, and the edges of K are dashed. 
(As in Chapter 14, an adequate figure is not easy to draw or to look at.) 
Proof. Let 
id LiLs..., Ls 


be the lines which contain either an edge of Kı or an edge of Ky. (For Fig. 24.43, 


we would have n — 9.) 
Now Lı decomposes each T; in K, (and each T; in K3) into two convex regions, 


if Lı intersects the interiors of these sets at all: 


[SS 


\ 
A 
FicunE 24.44 


(In the figure, we show a possibility for L1.) By induction it now follows that the 
union of all of the L;'s decomposes R into a finite collection, 


C= {C1,Ca,..., Cr}, 


of convex polygonal regions, like this: 


LES 


Figure 24.45 


Evidently every C; lies in some one T; € K, and in some one T; € Ky. Vor each 
C; we take a star triangulation. These fit together to give our common sub- 
division K. 
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Let R, and Rz be polygonal regions. Suppose that they have triangulations 
Ki = (Ty To, ..., Tn}, 
Ky 5 UM Tay ne Eads 


such that for each 7 we have 
T; z Ti. 


Then we say that Rı and Rz are equivalent by finite decomposition, and we write 
Ri = Ro. 


Here by T; = Ti we mean that the corresponding triangles are congruent in the 
elementary sense. In fact, this is equivalent to saying that 7; and Ti are isometric 
in the sense of Chapter 17. 

Equivalence by finite decomposition is a familiar idea in simple cases; it means, 
intuitively, that you can cut E, into little triangular pieces, with scissors, and then 
put the pieces back together again, usually in a different way, to get Re. 


AUS 


FIGURE 24.46 


For example, in the figure above, we have T, = T4. Therefore 
T, U T2 U T; = T4 U T2 U Ts. 


In fact, this is the observation that people usually make to infer that the parallelo- 
gram and the rectangle have the same area. 


Theorem 3. Equivalence by finite decomposition is an equivalence relation. 


Proof. Trivially, = is reflexive and symmetric. We must now show that if 
Rı = Rz and Rz = Rs, then R; = Rs. 
Let K, and Ky be the triargulations used in exhibiting that Rı = Ro. That is, 


Ky = (7,, Ts», ttg Toys 
Kam UD T5. Th 
with i 
T; z TA. 


Let K} and K be the triangulations exhibiting that Re = R3. Let K be a common 
subdivision of Kz and K; (Fig. 24.47). Given any T; € K;, we observe that the 
corresponding T; € Kz has been subdivided in a certain way. We copy this sub- 
division scheme in T';, following the congruence T; = Ti backwards. This gives a 
subdivision K{ of K;, shown by the dotted lines in the figure. Similarly, we get 
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eo 


= 


K 
FIGURE 24.47 


a subdivision K3 of K3. We can now match up the elements of K1 and K4 in such 
a way as to show that E, = R3. (We omit the details, on the ground that a careful 
inspection of the figures is likely to convey the idea adequately and more easily.) 


24.6 EUCLIDEAN GEOMETRY: BOLYAI'S THEOREM 


If two triangular regions are equivalent by finite decomposition, then surely they 
have the same area. Thus if 
Ry NTa O UT. 
Rz = Ti U T5U...U Th 


T;z T; 


for each 7, then we have aT; = aT, from which we get aR; = ah, by addition. 
It was discovered by Bolyai that the converse is also true; if aR; = akg, it 

follows that Rı = Ry. This section will be devoted to the proof of Bolyai’s theorem. 
Given AABC, with BC considered to be the base: 


with 


Figure 24.48 


Let D and E be the midpoints of AB and AC; let L = DE; and let BF, AG, and 
CH be perpendicular to L. Then [ ]BFHC is a rectangle. (Remember the SAA 
Theorem.) We shall call ()BFHC the rectangle associated with A ABC. (Of course, 
[ ]BFHC depends on the choice of the base, but it will always be clear which base 
is meant.) 
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Theorem 1. Every triangular region is equivalent by finite decomposition to its 
associated rectangular region. 


Proof. For the case shown in Fig. 24.48, this follows from two triangle congru- 
ences. The proof in the general case is indicated by the figures below: 


(a) R,UR,=R,UR; B C 


Figure 24.49 
© T,uT,=T,UT; 


In Fig. 24.49(a), Ri = Ra. Therefore AABC = LJBDGC. Thus we need to 
show that LIBDGC = LIBFHC. If F lies on DG, as in Fig. 24.49(b), this follows 
from the fact that ADFB = AGHC. If not, we can reduce the theorem to this 
case by repeated applications of the theorem conveyed by Fig. 24.49(c). 


Theorem 2. If two triangular regions have the same base and the same area, 

then they are equivalent under finite decomposition. 

Proof. Let the triangular regions be T and 7", and let the associated rectangular 
regions be R and R’. Then R and R’ have the same base b. Since aR = aT = 
aT’ = ak’, we have aR = ak’, and R and R’ have the same altitude. Therefore 
R = R'. (Proof?) Thus 
T=R=R' =T, 
and T = T’, which was to be proved. 


Theorem 3. Bolyai’s Theorem. If two triangular regions have the same area, 
they are equivalent under finite decomposition. 
K A 


U I 
E ! 
Fıcure 24.50 OS 
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Proof. Given T;, T2, with oT; = aT2. Suppose that 7, is the union of A ABC 
and its interior. If a side of T, is congruent to a side of To, then T; = T; by 
Theorem 2. Suppose, then, that T? has a side of length a > AC. If we prove the 
theorem for this case, it will follow in general merely by a change of notation. 
As before, let L be the line through the bisectors D and E of AB and AC. Let J 
be a point of L such that CJ = a/2. [Query: How do we know that there is such a 
point?] Now take K so that C-J-K and JK = a/2. Let T; be the union of AKBC 
and its interior. By Theorem 2, we have 


T; z R and T= T; 


Therefore T; = R. Since we know already that # = T;, we have T, = To, which 
was to be proved. 

Note that the use of the transitivity of the velation has spared us some almost 
impossibly complicated figures, exhibiting the equivalence of 7; and 7's. 

We shall see that Bolyai's Theorem can readily be extended to polygonal regions 
in general. To show this, we use the following theorem. 


Theorem 4. In a Euclidean plane, every polygonal region is equivalent by finite 


decomposition to a triangular region. 


Proof. Given a polygonal region R, with a triangulation 
K — Us, Ts... T4). 
For each i, let 
Q; = aT;. 


There is now a complex which looks like this: 


A 
j Figure 24.51 
LI 

B, B, Bp B, 

If T; is the ith triangular region in this figure, then 
aT’, = aT;. 
Therefore 
T} = Ti 


for every i, by Theorem 3. Therefore R is equivalent by finite decomposition to the 
union of the regions T7. 

This is not a theorem of absolute geometry; it will turn out that under HPP the 
triangular regions are—in a certain sense—of bounded size, and the polygonal 
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regions in general are not. In the following section we shall elucidate this idea by 
showing the form that area theory takes in hyperbolic geometry. Meanwhile we 
generalize Bolyai’s theorem. 


Theorem 5. Let R and R’ be polygonal regions in a Euclidean plane. If aR = 
aR’, then R' = R. 


Proof. Let T and T" be triangular regions such that 
T=R, T'= R. 


Then aT = aR and aT’ = aR’. Therefore aT = aT’ and T = T'. Therefore 
R = R’, which was to be proved. 


24.7 HYPERBOLIC AREA-THEORY: THE DEFECT OF A POLYGONAL REGION 


We have already found, in Theorem 1, Section 24.4, that the defect of a triangle 
is additive in the same way that area is: 


A 
B D C 
FicvnE 24.52 


That is, if B-D-C, then the defect of AABC is the sum of the defects of AABD 
and AADC. This simple fact is the key to the development of an area theory in 
hyperbolic geometry. We begin by defining the area of a triangular region T to 
be the defect of the corresponding triangle. We denote the hyperbolic area by 
ôT, where à stands for defect, just as a stood for area. Under HPP, we know that 
ôT > 0 for every T. And we know that the additivity postulate holds, insofar 
as it applies at all. 

We would like to define our “area function” 6 more generally so as to make it 
apply to all polygonal regions. We shall do this in several stages. 

First, given à complex 

K= {Ti Tz- Tr}; 


we define 
6K = T, 5T2 = seep 9T,. 


We are now in the same situation as in Chapter 14. Every polygonal region R 
has infinitely many triangulations K; we would like to define à as 5K; but to do 
this we must first show that 5K depends only on R, and is independent of-the choice 
of the triangulation K. This is easy to see for star triangulations of a convex poly- 
gonal region. 
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Theorem 1. If Kı and K are star triangulations of the same polygonal region 
R, then ôK: = ôK». 


Figure 24.53 


The reason is that the total defect in each star triangulation is 
(n + 1)180 — (mZPo + mZP, +--+ mZP, + 360) 
= (n — 1)180 — (mZPo + mZP, +---+ mZP,). 


Here we are merely adding the measures of the angles at each vertex, and subtract- 
ing n(180). This theorem justifies the following definition. 


DzriNrTION. The defect ôR of a convex polygonal region R is the number which 
is the defect of every star triangulation of R. 


A border triangulation of a convex polygonal region is one which looks like 
this. (Exact definition?) 


P, 
Pz 
Pr 


P, 
Po 


Fieure 24.54 


Theorem 2. The defect of a border triangulation is the same as the defect of the 
region. 


The reason is that the defect of the border triangulation is 
n(180) — (mZPy, + mZP, +--+ mZ P, + 180), 


which gives the same answer as for star triangulations. 


Theorem 3. If a convex polygonal region is decomposed by a line into two such 
regions, then the defect of the union is the sum of the defects. 
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R, 


R 
Figure 24.55 


Proof. Given Rı U Re = R, Ry N Ra C L, asin the figure. Let P be any point 
of L in the interior of R; let Kı and K3 be the border triangulations in which P is 
the extra vertex. Let K = K, U Kz. Then, trivially, we have 


ôK = 5K, + 6K. 
Since 6K = ôR, 6K, = 5R, and 6K = 6Rz, this proves the theorem. 


Theorem 4. If K, and K; are triangulations of the same polygonal region R, 
then 5K, = 6K. 


The proof is very similar indeed to the proof of Theorem 2, Section 24.5. Exactly 


as in that proof, we let 
Li, Lo, ..-, Ln 


be the lines that contain either an edge of K, or an edge of Ky. As before, we use 
the lines L;, one at a time, to cut up the triangular regions in K, and K3 into smaller 
convex polygonal regions. At each stage, we know by Theorem 3 that the total 
defect is unchanged. When all the lines have been used, we form a star triangulation 
of each of the resulting convex polygonal regions; this final step also leaves the total 
defect unchanged. (In fact, the defect of a convex polygonal region C was defined 
to be the total defect in any star triangulation.) We now have a common subdivi- 
sion K of K, and Kg, with . 


and 


Therefore 6K; = 6K», which was to be proved. 


DzrixiTION. The area ôR of a polygonal region R is the number which is the 

defect ôK of every triangulation K of R. 

It is not hard to see that our area function 6 satisfies the postulates A-1 through 
A-5 of Section 13.1. (You may wonder, at first, about A-5, which says that the area 
of a rectangular region is the product of its base and its altitude. But under HPP, 
even this last postulate holds, for the rather odd reason that there are no rectangular 
regions to which it can be applied.) 
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24.8 BOLYAI'S THEOREM FOR TRIANGLES IN THE HYPERBOLIC CASE 


We shall see that Bolyai’s theorem holds under HPP. That is, if àR, = 6Ro, 
then it follows that Rı = Rə. Although a part of the proof follows the lines of 
Section 24.6, the technique is more complicated, so we shall need some preliminaries. 


Theorem 1. If two Saccheri quadrilaterals have the same upper base and the 
same defect, then their upper base angles are congruent. 


Restatement. Let [JABCD and [JA’B’C'D’ be Saccheri quadrilaterals (with 
right angles at A, D, A’, and D"). If BC = B'C' and 6 ]ABCD = 6_]A’B’C'D’, 
then ZB = ZB’ and ZC & ZC’. 


B Ç 
A D 
Figure 24.56 


Proof. It is not hard to calculate that 
à JABCD = 180 — (mZB + mZC) = 180 — 2mZB. 
Therefore mZB is determined by the defect; and from this the theorem follows. 
Theorem 2. Under HPP, if [ABCD and [ ]A'B'C' D' are as in Theorem 1, then 
L]JABCD = []A'B'C'D'. 


(Here the indicated congruence means that the correspondence ABCD 
A'B'C' D' preserves lengths of sides and measures of angles.) 


Proof. Let E and F be points of BA and CD such that 
BE = CF = B'A’ = C'D’. 


Figure 24.57 
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By SAS, we have ABCF = AB'CD. Therefore BF = B'D'. By angle subtrac- 
tion, we have ZEBF œ ZA'B'D'. By SAS, we have ABEF = AB'A'D'. 
Therefore Z E is a right angle. In the same way, we conclude that ZCFE is a right 
angle. Thus we have 

LJEBCF = [ ]A'B'C'D'. 
If E = A and F = D, this proves the theorem. If not, [ ]EADF is a rectangle, 


which is absurd; there is no such thing as a rectangle. 
The reader is warned that, hereafter in this section, to draw our triangles right 


side up, we are going to > draw our Saccheri quadrilaterals upside down. 
Given A ABC, with BC considered to be the base: 


Figure 24.58 


As before, let D and E be the bisectors of AB and AC; let F, G, and H be the 
feet of the perpendiculars from B, A, and C to L. Asin the Euclidean case, we have 


AFBD = AGAD, 
AGAE = AHCE, 
FB = GA = HC. 
(The elementary theory of congruence is a part of absolute geometry.) Therefore 


[JHCBF is a Saccheri quadrilateral. We shall call it the quadrilateral associated 
with ABC. It depends on the choice of the base, but it will always be clear which 


base we mean. 
Theorem 3. Every triangular region is equivalent by finite decomposition to its 
associated quadrilateral region. 


The proof is exactly like the proof of Theorem 1, Section 24.6; this proof de- 
pended only on congruences and the SAA theorem. 


Theorem 4. Every triangular region has the same defect as its associated quad- 


rilateral region. 
Because the two are equivalent by finite decomposition. 


Theorem 5. If AABC and ADEF have the same defect and a pair of congruent 
sides, then the two triangular regions are equivalent by finite decomposition. 
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A D 
t K M N 
| | | ! 
| I [ l 
[ l 
B Ç E F 
Figure 24.59 


Proof. It does no harm to suppose that the congruent sides are the “bases” BC 
and EF. If the associated Saccheri quadrilaterals are as indicated, then by 
Theorem 4 they have the same defect. By Theorem 2, [ JKCBJ = [ ]NFEM. 
Let 7, T2, Ri, Re be the regions determined by our triangles and quadrilaterals, 
then Rı = R2. (This requires a proof based on our congruence, but the proof is 
immediate.) Since T, = Ry and Ry = To, by Theorem 3, we have T, = T^, 
which was to be proved. 


Theorem 6. Given A ABC, D, E, and L = DE, as in the definition of the asso- 
ciated quadrilateral. Let A’ be a point on the same side of BÓ as A. If L contains 
the bisector E' of AC, then L also contains the bisector D' of A'B, and AABC 
and AA’BC have the same associated quadrilateral. 


Fieure 24.60 


Proof. Let J and K be the bisectors of FH and BC. Then JK L BÓ and JK L 
DE, as indicated in the figure, Therefore 

(1) DË is the perpendicular, through Æ, to the perpendicular bisector of BC. 

Applying precisely the same reasoning to A A'BC, we get 

(2) D'E' is the perpendicular, through E', to the perpendicular bisector of BC. 


Since we know that Æ’ lies on DE, it follows that DE — DE, and D' lies on L. 
Therefore AABC and AA'BC have the same associated quadrilateral [ ]HCBF, 
which was to be proved. 


Theorem 7. If T, and T; are triangular regions, and 67, = 672, then T, = T;. 


E 


SE ee ae 


DEFECTS OF SMALL TRIANGLES 337 


Proof. Let the associated triangles be AABC and AA'B'C'.. 1f a side of one is 
congruent to a side of the other, then 7, = T2, by Theorem 5. If not, we may 


suppose that 
a= AU > AC, 


(This proof is going to be very similar to that of Theorem 5, Section 24.6.) 


Figure 24.61 


As indicated in the figure, let [JHCBF be the quadrilateral associated with 
AABC. Let J be a point of L such that CJ — a/2. Then take K so that C-J-K 
and JK — a/2. By Theorem 6, (CJHCBF is the quadrilateral associated with 


AKBC. 'Thus 
sAKBC = &[]HCBF = 5A ABC = &AA'B'C'. 


Now AKBC and AA'B'C' have a pair of congruent sides and the same defect. 
Therefore the corresponding regions T3, T are equivalent by finite decomposition. 
Let R be the region corresponding to [JHCBF. Then we have 


T,= R= T= T2. 


Therefore Tı = T», which was to be proved. 


24.9 DEFECTS OF SMALL TRIANGLES 


We shall show, in this section, that the defect of a triangle is as small as we 
please, if the triangle itself is sufficiently small. More precisely: 


Let e be any positive number. Then there is a positive number d 


Theorem 1. 
BC have length less than d, then SAABC < e. 


such that if all the sides of AA 


We proceed to the proof. We have given a positive number e. Consider a right 
angle ZP,PQ;, with P,P = PQ, = 1 (Fig. 24.62). For each n, take Pn and Qn, 
as indicated, so that PP, = PQn = 1/n. Thus we get à sequence of convex 
quadrilaterals CIP: P2Q291, [LIP2P3Q3Q2,.--- Let dn be the defect of the nth 
quadrilateral CIP nPn+1Qn+19n; and let dy = 6AP,PQ:. Then 


di +dg+-+::+dn < do 
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P ...Q,0;Q, Qi 
Figure 24.62 


for every n. Therefore the infinite series, 
di c dace diee, 


is convergent. "Therefore lim,.,4 d, = 0. Therefore d, < efor some n. 

The sole purpose of the above discussion was to demonstrate the following 
statement. 

There is a convex quadrilateral [-]PQRS, of defect less than e. 


R 


8OPQRS «e 
Fieure 24.63 


Let T be any point between P and S. Let hy, ha, ha be the perpendicular distances 
from T to the other three sides of CIPQRS, and let d be the smallest of the numbers 
hi, ha, ha. 

It is now easy to check that d is a number of the sort that we wanted. Given any 
triangle A ABC, with sides of length less than d: 


Q R 
B 
Ns hrs 
P T=A' CS A G 
Figure 24.64 


By SAS, we can construct a congruent copy AA'B'C' of AABC in the quadri- 
lateral region, with T = A’. The copy really will lie inside the quadrilateral, 
because its sides are too short to reach the other three sides of the quadrilateral. 


^ 
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But now we are done, because 
5A ABC = 5AA'B'C' < e ]PQRS < e. 


This theorem tells us that it may be very hard to tell the difference between a 
hyperbolic plane and a Euclidean plane, if you are allowed to inspect only a small 
portion of it. One of the possibilities for physical space is that planes are hyper- 
bolic, but that the portions of them that we can examine from the earth are "small," 
so that the deviation from the Euclidean angle-sum formula mZ A + mZB + 
mZC = 180 is too small to be detected, for every triangle small enough for us to 
test. C. F. Gauss made a test of this sort, using the peaks of three neighboring 
mountains as the vertices of his triangle. He was unable to observe a deviation 
from the Euclidean formula, but obviously it is possible that his mountains were 


too neighboring. 


24.10 THE CONTINUITY OF THE DEFECT 


Given AABC and ADEF, with sAABC > 6ADEF. (Note that in the light 
of the results in the previous section, a plausible figure should make the second 


triangle look smaller.) 
B 
2s ~~ 
A P, C D F 


FicunE 24.65 
For 0 < r € mZB, let P, be a point of AC such that mZ ABP, = r. (By 
the cross-bar theorem, for each such r there is a point P,.) We know that 


3A ABP, + 6AP,BC = 8AABC. 


For0 <r € mZB, let 


f(r) = BA ABP,. 


As a definition of f(0), we provide further that 
f(0) = 0. 


(Of course, the definition is reasonable: in effect, we are defining the defect of a seg- 


ment to be 0.) : x! 
[t is easy to see that f is a strictly increasing function: ifr < s, then we have 


f(s) = f(r) + $AP.BP,, 


ih fr) «fo. 
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A P. Ps [^ 
Figure 24.66 


It is reasonable to suppose that f is continuous. If this is not merely reasonable 
but also true, then we will have the following theorem. 


Theorem 1. If 5A ABC > 5ADEF, then there is a point P between A and C 
such that 58A ABP = 5ADEF. 


The reason is this. The graph of y = f(r) looks like this: 


FiaunE 24.67 


Being continuous, f takes on every value between its initial value 0 and its final 
value àA^A BC. Therefore f(r) = ôADEF for some F. Let P = P;. 

We shall need this result to generalize Bolyai's theorem to arbitrary polygonal 
regions in the hyperbolic case. We therefore complete its proof. 


lemma. fis continuous. 


Given 0 S k € mZ B, we need to show that 
lim f(r) = f(b). 


By definition, this means that the following condition holds. 
(1) Given 0 S k € mZB. For every e > 0 there is a d > 0 such that if 


|r "T k| <d, 
then 
lf(r) — f(k)| < e. 


Let us interpret this geometrically in terms of our definition of f. 
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B 


| 
A 
A Pi P. 
Figure 24.68 


We have 
|r — k| = mZP,BP;. 


(Note that if r < k, which it may be, we need the absolute value signs to make 
this formula correct.) Also 
uem) — sK) |BAABP, — ôA ABP, 


= 5AP,BP;. 


I 


(Here again the absolute value signs are needed to take care of the possibility 
r < k) 
In these terms, Condition (1) takes the following form. 
(2 Given 0 S k € mZB. For every e > 0 there is a d » 0 such that 
mZ P,BP, < d, 


then 
oAP,BP, < €. 


We shall prove (2). First let us suppose that BC > BA, as the figures suggest. 


B 
d^ 


A P? C~ E F 
FıcureE 24.69 


If A-C-E, then BP < BE for every point P of AG. (Proof?) We assert that there 


are points E and F such that A-C-E, A-C-F and 
SAEBF < e. 


The reader should be able to produce a proof, following a scheme suggested by the 


proof of an analogous result in the preceding section. Given such an E, F, we let 


d = mZEBF. 
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——— Á Á— 


B 


A Pk P, C B F 
Figure 24.70 
This is a number d of the sort that we wanted. The point is that if mZP,BP, < 
mZ EBF, then AEBF contains a congruent copy AP,BP; of AP,BP,. Evidently, 
dAP,BP, = 5APLBPY < &AEBF. 
Thus for mZP,BP, < d, we have 9AP,BP, < e, which was to be proved. 


24.11 BOLYAI'S THEOREM FOR POLYGONAL REGIONS IN THE HYPERBOLIC 
CASE 


Theorem 1. Under HPP, if two polygonal regions have the same area, then they 
are equivalent by finite decomposition. 

That is, if 62; = ôR}, then Ry = Ro. 

To prove this, we take any triangulations K 1, K2 of R, and R3: 


E. 
AN 


Figure 24.71 


Some one triangular region in one of these complexes must have minimum defect. 
That is, some T in either K, or K, must have the property that ôT < ôT” for 
every 7" in K, or Kg. Suppose that this is T; € Ki. Let Ti € K;. If it happens 
that 67, = ôTi, we delete T, from Kı and delete T{ from K}. This gives new 
complexes Ki, K2, with fewer elements than K 1 and Ky. Let R{, R} be the cor- 
responding regions. If R{ = R4, then Rı = Rs. 

If T, < 67}, then we know by Theorem 1, Section 24.10, that T1 can be sub- 
divided into two triangular regions U, V, such that 


êV = 87). 


We now delete 7, from K,, and we replace T; by U in K3. If the resulting regions 
are equivalent by finite decomposition, then so also are R, and R3. 
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Thus, in either case we can reduce our theorem to a case in which the total 
namber of triangular regions is smaller than it was to start with. In a finite number 
of such steps, we can reduce the theorem to the case of two triangular regions, for 
which the theorem is known to be true. 


24.12 THE IMPOSSIBILITY OF EUCLIDEAN AREA-THEORY IN HYPERBOLIC 
GEOMETRY 

To define the area of a polygonal region as the number which is the total defect 
of each of its triangulations may seem to be a peculiar proceeding. We shall show, 
however, that this peculiarity is inevitable. Under HPP, it is impossible to define 
an area funetion which has even a minimal resemblance to the Euclidean area 
function. In the following theorem, it should be understood as usual that 6t is the 
set of all polygonal regions and that R, R;, and so on, denote polygonal regions. 


Theorem 1. Under HPP there does not exist a function 


such that CANTE 


(1) aR > 0 for every R; 
(2) if Rı and Rz intersect only in edges and vertices, then 


a(R, U R2) = aR; + aha; 


(3) if T, and T; are triangular regions with the same base and altitude, then 
aT, = aT». 
(Surely these are minimum requirements for an area function of the Euclidean 


type.) 
Suppose that there is such a function a. Then by (2) and (3), we have: 


(4) if Ry = Rz, then eR, = «Rs, because congruent triangles have the 
same bases and altitudes. 


FIGURE 24.72 


Consider now a right angle ZAPoP;, with APo = PoP; = 1. For each n, let 
P, be the point of PoP; such that PoP, = n. This gives a sequence of triangles 


AAPP, AAP,Ps, ..., 


344 HYPERBOLIC GEOMETRY 
and a corresponding sequence of triangular regions 
Ty, T», eves 


By (3), all the regions 7; have the same “area” aT. A, 
Now consider the corresponding defects 


For each n, [ 
di + dz + -+-+ dy = ôA APP, < 180. 


Since the finite sums d, + ds + +++ + d, are bounded, it follows that/the infinite 
series, / 
di t da d tdn te, 
is convergent. "Therefore / 
jim d, = 0. 


Hence 


d, < d; for some n. 


Figure 24.73 
By "Theorem 1, Section 24.10, there is a point B, between A and Po such that 
SABPoP, = &AAP, P, = dp. 
Therefore, by Bolyai’s theorem (Theorem 1, Section 24.8), the regions 7’, T', deter- 


mined by these triangles are equivalent by finite decomposition. By (4), this means 
that 


aT z af,. 
But 

aT, = aT, = A, 
Therefore 

aT = aT. 


But this is impossible, because aL ABP, > Oand 
aT, = aT + aAABP,. 
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24.13 THE UNIQUENESS OF HYPERBOLIC AREA THEORY 


Any reasonable area function æ should have the following properties: 


(1) aR > 0 for every R; 
(2) if R4 and R3 intersect only in edges and vertices, then 


a(R, U R2) = aR, + aR; 
(3) if Ry = Re, then aR; = aR». 


We know of one such function, namely ô, and it is plain that there are lots of 
others. If k is any positive real number, and 


aR = kik 


for every polygonal region R, then o satisfies (1), (2), and (3). On the other hand, 
this trivial way of getting an area function different from à is in fact the only way. 


Theorem 1. Let 
a:R5R 


be an area function satisfying (1), (2), and (3). Then there is a k > 0 such that 


aR = kik 
for every R. 
In the proof, it will surely be sufficient to find a k > 0 such that aR = kik 
whenever R is a triangular region; the general formula will then follow by addition. 
In fact, it will be sufficient to prove the following lemma: 


Lemma. For every two triangles A ABC and ADEF, we have 


aADEF _ 5ADEF. 
aAABC SAAB 


If this holds, then we have 
a^AABC | aADEF 
AABO SADEF’ 


and the desired k is the fraction on the left; for every ADEF, we have 3 
aADEF = kiADEF. 


Proof. For the case when 6A ABC = 5A DEF, we have AABC = ADEF by 
Bolyai’s theorem; by (3) it follows that 6A ABC = ADEF. Therefore the lemma 


holds. 
We may therefore assume that 


sA ABC > ADEF. 
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A 


=P, 
Boks P, P; P; P Pin 


Figure 24.74 


By Theorem 1, Section 24.10, there is a point P between A and C sucli that 
5^ABP = ôADEF. 
Since the defect is continuous, we can take a sequence 
B = Poy PiyesanPaxsa bg Cy 
of points, in the stated order on BC (in the figure above) such that 


SAAP;Pi41 = T SAABC 


for every i. Thus the segments AP; cut AABC into q triangles with the same 
defect. If we remember Bolyai’s theorem, and Conditions (2) and (3), we can easily 
see that each of the following conditions is equivalent to the next: 


(a) : < oe 
"ev: 

(e) S aA ABC < aA ABP, 
(d) A-P-P, 

(€) MA ABC < &AABP, 


3A ABP 
SA ABC’ 


SA DEF | 
3A ABC 


By the comparison theorem, the lemma follows. 


(f =< 


Rie. Rie. 


(g) - « 


24.14 ALTERNATIVE FORMS OF THE PARALLEL POSTULATE 


We observed in Section 24.1, as a consequence of the all-or-none theorem, that 
the following statement could be used as a substitute for the Euclidean parallel 
postulate, EPP. 
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(1) For some line and some point, parallels are unique. 

Many of the other theorems of this chapter give us such alternative forms of 
EPP. 

(2) The plane contains at least one rectangle. 

See Theorem 3, Section 24.3, which tells us that under HPP the plane contains 
no rectangles at all. 

(3) The plane contains at least one triangle for which the angle sum is 180. 

See Theorem 5, Section 24.3, which tells us that under HPP the angle-sum 
equality never holds. 

(4) The plane contains at least two triangles which are similar without being con- 
gruent. 

See Theorem 2, Section 24.2, which tells us that under HPP, similarity without 
congruence cannot occur even once. 

(5) There is an area-function 

a:R—>R, 

such that aR is always positive, a is additive for regions intersecting only in edges 
and vertices, and aA ABC depends only on the base and altitude of A ABC. 

See Theorem 1, Section 24.12, which says that under HPP there is no such 


function. 
The persuasiveness of these statements may make it easier to understand the 


state of mind of Saccheri and others. 


-— 


THE CONSISTENCY OF THE 
HYPERBOLIC POSTULATES 


25.1 INTRODUCTION 


In this chapter, we shall show that the Poincaré model, described in Chapter 
satisfies all the postulates of hyperbolie geometry. Our analysis of the model 
depend, of course, on Euclidean geometry, and so our consistency proof will be. 
conditional. At the end of the chapter we shall know not that the hyperbolie - 
postulates are consistent, but merely that they are as consistent as the Euclidean ji 
postulates. In the following chapter, we shall investigate the consistency of the 
Euclidean postulates. (See the discussion at the end of Chapter 9.) p 


25.2 INVERSIONS OF A PUNCTURED PLANE 


Given a point A of a Euclidean plane E and a circle C with center at A and | 
radius a. The set E — A is called a punctured plane. The inversion of E — A — 
about C is a function, 

J:E—A € E — A, 


defined in à the following way. For each point P of E — A, let P’ = f(P) be m F 
point of AP for which 


' AP’ = 


Figure 25.1 


(Thus, fora = 1, we have AP’ = 1/AP.) Since a?/a = a, we have the following ; 
theorems. » 
348 
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Theorem 1. If P € C, then f(P) = P. 


Theorem 2. If P is in the interior of C, then f(P) is in the exterior of C, and 
conversely. 


Theorem 3. For every P, fp) = P. 
That is, when we apply an inversion twice, this gets us back to wherever we 
started. 


Proof. f(P) is the point of AP for which Af(P) — a?/ AP, and j(f(P)) is the 
point of the same ray for which 


a? 


2 
Af(f(P)) = ARP = yar AP. 
Therefore f(f(P)) = P. 


Theorem 4. If L is a line through A, then f(L — A)=L—A. 


Here by f(L — A) we mean the set of all image points f(P), where P € L — A. 
In general, if 
KCE-A, 
then 
f(K) = (P' = DP e Kj. 


It is also easy to see that “if P is close to A, then P’ is far from A,” and con- 
versely; the reason is that “q2/AP is large when AP is small.” In studying less 
obvious properties of inversions, it will be convenient to use both rectangular and 
polar coordinates, taking the origin of each coordinate system at A. The advantage 
of polar coordinates is that they allow us to describe the inversion in the simple 


form 
f:E-—-AcoE— A, 


: (r, 0) e» (s, 9), 
where 
a? 
s zz — 
= 
and 
a? 


Papua 
In rectangular coordinates, we have 
P = (x,y) = (reos 6, r sin 6), 
f(P) = (u, v) = (scos 8, 8 sin 6), 
where r and s are related by the same equations as before. Evidently 


s? = u? + v? 
just as 
r = g? y. 
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These equations will enable us to tell what happens to lines and circles under 
inversions. We allow the cases in which the lines and circles contain the origin A ^ 
80 that they appear in E — A as “punctured lines" and “punctured circles,” Thus 
we shall be dealing with four types of figures, namely, lines and circles, punctured 
and unpunctured. For short, we shall refer to such figures as k-sets. The rest of 
this section will be devoted to the proof that if K is a k-set, then so also is f(K). Let 
us look first, however, at a particular case. 

Let K be the line x = a. 


Figure 25.2 


Then K is the graph of the polar equation 
T COS Ü = a, 


Since r — a?/s, where f(r, €) = (s, 6), it follows that S(K) is the graph of the 
condition } 


a? 
"y 008 9 = a, s0 

or 
= a cos 8, s*0 

or 


= ascos 0, s # 0. 
In rectangular form, this is 


u? + v? = au, u? + v? x 0. 


Replacing u and v by x and y (to match the labels on the axes), we see that f(K) 
is the graph of 
z? — ar +y? = 0, z? + y? * 0, | 


and is hence the punctured circle with center at (a/2, 0) and radius a/2. Thus Í 
has pulled the upper half of the line K onto the upper semicircle, and the lower half 
onto the lower semicircle, It is easy to see that points far from the z-axis (either 
above or below) go onto points near the origin. 
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More generally, we have the following theorem. 


Theorem 5. If K isa line in E — A, then f(K) is a punctured circle. 


Proof. Since we can choose the axes any way we want, we are free to assume 
that K is the graph of a rectangular equation 


xb 0, 
and hence of a polar equation 
rcos0 = b > 0. 


As before, setting r = a?/s, we conclude that f(K) is the graph of 


2 
= cos 8 = b, s #0, 


or 
a? 
s$? = = s cos 6, s #0, 
or 
2 a? 2 2 2 
ae ee u? +v £0, 
or 


2 a? ee 2 2 
ety 9 r? +y“ #0. 


Therefore f(K) is a punctured circle, with center at (a?/2b, 0) and radius a?/2b. 
Tt is easy to see that (1) every punctured circle is described by the above formula 

for some choice of b and some choice of the axes. "Therefore (2) every punctured 

circle L is = f(K) for some line K. But Theorem 3 tells us that f( j(P)) = P for 


every P. Therefore 
fü) = £00) = K. 
Thus we have the following theorem. 


Theorem 6. If Lis a punctured circle, then f(L) isa line in E — A. 


We now know, from Theorem 4, that under f, punctured lines go onto punctured 
lines; and we know, by Theorems 5 and 6, that lines go onto punctured circles and 
vice-versa. Now we must see what happens to circles. 


Theorem 7. If M isa circle in E — A, then f(M) is a circle in E — E S 
Proof. M is the graph of à rectangular equation 
z? + y? + Az + By+C — 0, 
where C æ 0 because the circle is not punetured. In polar form, this is 


r? + Ar cos 0 + Br sin 0 + C = 0. 
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Since r = a?/s, this tells us that f(M) is the graph of the equation 


4 2 2 

gt Aces 04. B-T sing+C = 0, 

or ^ 

a* + Aa*s cos 0 + Ba?s sin 0 + Cs? = 0, 
or t 

at + Aa?u + Ba%v + C(u? + v?) = 0. 


Replacing u and v by x and y, to match the labels on the axes, we get an equation 
for f(M) in the form 


gays, Eat 
ety t^t oW. tv- 
The graph f(M) is a circle; this circle is not punctured, because a?/C # 0. 


Theorem 8. If K is a k-set, then so also is KK). 


25.3 PRESERVATION OF THE CROSS RATIO UNDER INVERSIONS 


We recall, from Section 9.2, the definition of distance in the Poincaré model. 


R C 


FIGURE 25.3 


If T and U are points of the L-line with end points R, S on the boundary circle C, 
then the non-Euclidean distance is defined by the formula 


TR/TS 
*UR/US 
The fraction whose logarithm gets taken in this formula is called the cross ratio 
of the quadruplet R, S, 7, U, and is commonly denoted by (R, S, 7, U). Thus 


a(T, U) = 


log, 


TR: US 


(R, S, 1 U) = UR. TS’ 


and changing the notation slightly, we have 


EEI PP 
(P, Pa, Ps, P4) = PiP, P,P, 
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We shall show that inversions preserve the cross ratio. In the following theorem, 
f is an inversion of a punctured plane E — A about a circle with center at A and 
radius a, as in the preceding section. 


Theorem 1. If P = f(P) 6 — 1, 2, 3, 4), then 


(P3, Pa, Pa, P4) = (Pi, Ph, PS, P4). 


Proof. For each i from 1 to 4, let the polar coordinates of P; be (ri, 6). By the 
usual polar distance formula, we have 


P:P? = ri T r? — 2r;rj cos (6; — 6;). 


2 
Pt = (Si, 0) = (5. s): 


Now 


Therefore 
(Pu P5, Ps, Ps)” 
i+ rà — 2ryrg cos (01 — 63)lirà + rà — 2rar4 cos (02 — 04), 


ir? + rå — 2xira cos (81 — Balir + r3 — 2rers cos (02 — 83)) 


and 
(P^, P5, P5, P4)? 
4 4 4 4 4 4 
ae a Gai O a 
@ 1-2. — 9——co8 (01 — 93) ll- =. — 2— cos (62 — 0 
Fs = d FE (01 JE s ü 33 (05 o] 
wean gA at a4 as FT TA IO MEA 
ai at 5 9* eos (61 — 04) [> + M 2a 008 (02 — ? 
E i ri nn e pn 2 rå D (05 J 


To reduce the second of these fractions to the first, we multiply in both the numer- 
ator and denominator by s 


riririri 


at 


This theorem will tell us, in due course, that inversions applied to the Poincaré 
model are isometries, relative to the non-Euclidean distance. 


25.4 PRESERVATION OF ANGULAR MEASURE UNDER INVERSIONS 


A reexamination of Section 25.2 will indicate that the image of an angle, under 
an inversion, is never an angle. The point is that every angle in E — A has at 
least one side lying on à nonpunctured line, and the image of a nonpunctured line 
is always a punctured circle. Therefore the following theorem does not mean 
what it might seem to mean. 
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A Q' Q 
Figure 25.4 
Theorem 1. If A, P, and Q are noncollinear, P’ = f(P) and Q' = f(Q), then 
mZAPQ = mZAQ'P’. 


Proof. Consider APAQ and AQ'AP'. They have the angle ZA in common. 
Since 


glee a? 7 a? 
AP = Fp’ AQ = Xd 
we have 
AP- AP’ = AQ. AQ' = a?, 
so that 
AP _ AQ 
AQ APY 


By the SAS similarity theorem, 
APAQ ~ AQ'AP'. 


(Note the reversal of order of vertices here.) Since Z APQ and Z AQ'P' are cor- 
responding angles, they have the same measure. 


Figure 25.5 


In the figure above, P’ = f(P) and Q’ = f(Q), as before. Here we have 
u = 180 — a —r 
= (180 — r) — a 
=8— a. 
Therefore 


s—u=aą. 


The order of s and u depends on the order in which P and P' appear on the ray. 
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If P and P’ are interchanged, we should interchange s and u, getting 
w—s-a. 
Thus in general we have 


|s — ul = a. 


Consider next the situation illustrated in the figure below: 


Fieure 25.6 


Here B is the center of. a circular arc; PO is a line intersecting the arc at E; PŠ is 
a tangent ray at P; and RaP = a. We assert that 


lim mZ R,PQ = mZSPQ. 
a0 
lim, o mZ R,PS = 0.) 


(Proof? The first step is to show that 
nd point at a point Q. For small positive 


Consider now a circular are QS with e 


Figure 25.7 


numbers a, let Ra be the point of the arc for which 


QR, — a. 
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Let (/8 be the image of QS; that is x 
GP = N); 
let QR’ be the tangent ray at Q’. 
rt that 
m ZAQR = ZTQR. 


To see this, we observe that m< TQR, and m Z AQ'R; are the s and u that we dis- 
cussed just after Theorem 1. Therefore 


ImZ TQR, — mz AQ'Ri| = a. 


Now 
lim mZTQR, = mZTQR, 
2-0 
and 
lim mL AQ'Rz = mZAQ’R’. 
Therefore 


lim (m2 TQR. — mZ AQ Ri) = mZTQR — mZ AQ R. 


But the absolute value of the quantity indicated in square brackets is =a; 
a — 0 asa — 0. Therefore 


mZTQR = mZAQ' R’. 


Given two intersecting circles or lines, the tangent rays give us “tangent angi : 
like this: 


Figure 25.8 


By the preceding result, we have the following theorem. 


Theorem 2. Under sg corresponding tangent angles are congruent. 

That is, if AB and | AC are ares with a tangent angle of measure r, then their — 
images f(AB) and f(AC) have a tangent angle of measure r. Similarly for an arc 
and a segment or a segment and a segment. 


REFLECTIONS ACROSS LUNES IN THE POINCARÉ MODEL 357 


25.5 REFLECTIONS ACROSS L-LINES IN THE POINCARE MODEL 


We recall that the points in the Poincaré model are the points of the interior E 
of a circle C with center at P; the L-línes are (1) the intersection of E with lines 
through P and (2) the intersection of E with circles C' orthogonal to C, 


Fioure 25.9 


If L is an L-line of the first type, then the reflection of E across L is defined in 
the familiar fashion as a one-to-one correspondence, 


Ji BoE, 


such that for each point Q of E, Q and f(Q) are symmetric across L. 

If L is an L-line of the second type, then the reflection of E across L is the inver- 
sion of E about C’. To justify this definition, of course, we have to show that if f 
is an inversion about a circle C’ orthogonal to C, then f(E) = E. But this is not 
hard to show. In the next few theorems, it should be understood that f is an inver- 
sion about C"; C' has center at A, and intersects C orthogonally at R and S; and 
L=EnC. 


Froure 25.10 


Theorem 1. f(C) = C. 
Proof. f(C) is a circle. This circle contains R and S, because f(R) = R and 
f(S) = S. By Theorem 2 of the preceding section, f(C) and C' are orthogonal. But 
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there is only one circle C which crosses C" orthogonally at R and S. (Proof? Show 
that P must be the center of any such circle. Therefore f(C) — C, which was to 
be proved. 


Theorem 2. f(E) — E. 


Proof. Let X be any point of E. Then AX intersects C at points T and U. 
Since f(C) = C, we have U = f(T) and T = f(U). But inversions preserve 
betweenness on rays starting at A. Therefore f(TU) = TU, and f(X) € E. Thus 
K(E) c E. 

We need to show, conversely, that E C f(E). This is trivial: given that f(E) C E, 
we have f(J(E)) C f(E). Since f(f(E)) = E, this gives E C f(E). 


Theorem 3. If M is an L-line, then so also is f(M). 


Proof. M is the intersection E N D, where D is either a circle orthogonal to C or 
a line orthogonal to C. Now f(D) is orthogonal to C, and is a line or circle (punc- 
tured or unpunctured). Let D' be the corresponding complete line or circle. (Thus 
D' = f(D) or D' = f(D) U A.) Then 


KM) = f(D) NE 
es f =D NE, 
which is an L-line. 


We recall that an L-angle is the angle formed by two “rays” in the Poincaré 
model. 


Figure 25.11 


The measure of an L-angle is the measure of the angle formed by the tangent rays. 
We can now sum up nearly all of the preceding discussion in the following 
theorem. 


Theorem 4. Let f be a reflection of E across an L-line. Then, 


(1) fis a one-to-one correspondence E + E; 

(2) f preserves the non-Euclidean distances between points; 
(3) f preserves L-lines; 

(4) f preserves measures of L-angles. 


For L-lines of the first kind (passing through P) all this is trivial, because in this 
case f is an isometry in the Euclidean sense. It therefore preserves distances of 
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both kinds, lines, circles, orthogonality, and angular measure. For L-lines of the 
second kind, Conditions (1) through (4) follow from the theorems of this section 
and the preceding two sections. 


25.6 UNIQUENESS OF THE L-LINE THROUGH TWO POINTS 


Given the center P of C, and some other point Q of E. We know that P and Q 
lie on only one (straight) line in the Euclidean plane. Therefore P and Q lie on 
only one L-line of the first kind. But P does not lie on any L-line of the second 
kind. (The reason is that on the right triangle A A RP in the figure, the hypotenuse, 
AP, is the longest side.) It follows that the L-line through two points of E is 
unique, in the case where one of the points is P. 


e 


FicunE 25.12 


A 


To prove that uniqueness always holds, we need the following theorem. 


Theorem 1. For each point Q of E there is a reflection f such that f(Q) — P. 


Figure 25.13 


Proof. We start by the method of wishful thinking. If the inversion f about C" 
gives f(Q) = P, then 
a? > 
AQ 
We recall that the radius PR = 1. Let k = QP, and let x be the unknown distance 


AP = 
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AP (Fig. 25.13). Then the equation 


AP - AQ = a? 
takes the form 
z(r— k) = z? — 1 
or 


Ex or z=] 


Since Q is in Æ, we know that k < 1. Therefore z > 1, and A is outside C. If C" 
is the circle with center at A, orthogonal to C, then the reflection across E N C' 
is the one that we wanted. 

We can now prove the following theorem. 


Theorem 2. In the Poincaré model, every two points lie on exactly one L-line. 


Proof. Let Q and R be points of E. Let f be a reflection across an L-line such that 
f(Q) = P and f(R) = R’. We know that P and R’ lie on an L-line L. Therefore 
Q and R lie on the L-line f(L). If there were two L-lines L1, Lo, containing Q and 
R, then f(L,) and f(L2) would be different. L-lines containing P and R’, which is 
impossible. 

By Theorem 1, we can speak of the L-line containing Q and R. We shall denote 
this by QR; and to avoid confusion, we shall agree not to use this notation, in the 
rest of this chapter, to denote Euclidean lines. 


25.7 THE RULER POSTULATE, BETWEENNESS AND PLANE SEPARATION 


Our strategy in this chapter is to verify statements about L-lines, first for the 
easy ease of L-lines through P, and then to use inversions to show that the "curved" 
L-lines behave in the same way as the "straight" ones. In this spirit, we first check 
the ruler postulate for L-lines through P. 


Theorem 1. Every L-line through P has a coordinate system. 


| &cSE xU Go mim 
e ———————À—— 
R Q'" «E S 


Figure 25.14 


Proof. Suppose that L passes through P, and let its end points on C be and S. 
For each point Q of L, let 


KQ = 10g. GRO 


"og, be ^ 
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(Because PR = PS.) Let QS = x. Then 


QR = 2—QS = 2-2, 
and we have 
2—2 
TW 


J(Q) — log. 


Obviously f is a function L — R into the real numbers. We need to check whether 
f is a one-to-one correspondence L «+ R. Thus we need to show that every real 
number k is —f(Q) for exactly one point Q. Thus we want 


2—r 
k = loge; 

or 
E d 
x 

or 
(+ 1z = 2 

or 
Tag Sy, 
+l 


For every k there is exactly one such z, and 0 < x < 2, asit should be. Therefore 
every k is =f(Q) for exactly one point Q of L. 

We have already checked, in Chapter 9, that when the coordinate system f is 
defined in this way, the distance formula 


d(T, U) = |f(T) — f(U)| 
is always satisfied. 
Before proceeding to generalize Theorem 1, we observe that the formulas above 


give us some more information: 


X3 X; x, 0 

OO OF 

R Qı Q: Q; S 
Figure 25.15 


In the figure z; = Q,S for i = 1, 2, 3. It is easy to check that (2 — z)/z is a 
decreasing function. (Its derivative is —2/x? < 0.) And the logarithm is an in- 
creasing function. Therefore, if xı < 22 < 23, as in the figure, it follows that 


J(Qi) < f(Q2) < f(Qs), 


and conversely. We recall that betweenness is defined in terms of distance, and 
that one point of a line is between two others if and only if its coordinate is between 
their coordinates. Thus we have: 


Theorem 2. Let Qi, Q2, Qa be points of an L-line through P. Then Q;-Q2-Q5 
under the non-Euclidean distance if and only if Q;-Q2-Qs in the Euclidean plane. 
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Theorem 3. Every L-line has a coordinate system. 


Proof. Given an L-line L. If L contains P, there is nothing to prove. If not, 
let Q be any point of L; let g be a reflection such that g(Q) = P; let L’ = g(L), 
and let 

f:L'eR 
be a coordinate system for Z’. For each point T of L, let 


F(T) = f(g(T)). 


That is, the coordinate of T is the coordinate of the corresponding point g(T) of L’. 
Since f and g are one-to-one correspondences, so also is their composition f(g). 
Given points 7, U of L, we know that 


d(T, U) = d(g(T), g(U)), 
because inversions preserve the non-Euclidean distance. This in turn is 


= |f(g(T)) — f(g(U))], 
because f is a coordinate system for L’. Therefore 


d(T, U) = |f'(T) — f(U)|, 
which was to be proved. 


Theorem 4. Every L-line through P separates E into two sets Hı and H» such 
that (1) H, and H3 are convex, and (2) if Q € H, and R € Hz, then QR inter- 
sects L. 


Here QR means of course the non-Euclidean segment. 


Figure 25.16 


Proof. We know that the Euclidean line containing L separates the Euclidean 
plane into two half-planes Ht, Hj. Let H, and H, be the intersections H1 N E and 
H; N E, as indicated in the figure, 

Suppose that Q, R € H,, and suppose that QR intersects L in a point S. Letf 
be an inversion E +> E, about a circle with center A on the line containing L such 
thatf(S) = P. Then f(QR) is an L-line through P, and f(Q) and f(R) belong to H1- 
Since Q-S-R, we have f(Q)-P-f(R), in the non-Euclidean sense, because f preserves | 


| 
| 


RULER POSTULATE, BETWEENNESS AND PLANE SEPARATION 363 


the non-Euclidean distance. , Therefore f(Q)-P-f(R) in the Euclidean sense, which 
is impossible, because f(Q) and f(R) are in the same Euclidean half plane. 

It follows, in the same way, that H» is convex. Thus we have verified half of the 
plane separation postulate for the Poincaré model. 

Suppose now that Q € H, and R € Hz. Let C' be the Euclidean circle that con- 
tains the L-line QR: 


ELX 


Figure 25.17 


Then L contains a point S of the Euclidean segment from Q to R, and S is in the 
interior of C’. It follows that the Euclidean line containing L intersects C’ in two 
points, one of which is a point T of L. Now we must check whether Q-T- R in the 
non-Euclidean sense. [Hint: Use an inversion f:E €» E, Hı e Hi, Hz > Hz, 
T «€» P, and then apply Theorem 2.] 

To extend this result to L-lines in general, we observe that: 

Theorem 5. Reflections preserve betweenness. 


Because they preserve lines and distance. 


Theorem 6. Reflections preserve segments. 


Because they preserve betweenness. 


Theorem 7. Reflections preserve convexity. 


Because they preserve segments. 


Theorem 8. The plane separation postulate holds in the Poincaré model. 


Proof. Let L be any L-line, and let Q be any point of L. Let f be a reflection such 
that f(Q) = P; let I’ = f(L); and let H1 and Hz be the half-planes in E deter- 
mined by L/. Let 


Hı —[-^(Hi) and | Hy = f~'(H). 


Since f^! is also a reflection, and reflections preserve convexity, it follows that 
H, and H, are convex. This proves half of the plane separation postulate for L. 
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It remains to show that if R € H, and S € Ho, then RS intersects L. If R’ = f(R) 
and S' = f(S), then E € Hj and S' € H3, so that R’S’ intersects L’ at a point T". 
Therefore RS intersects L at T = f-'(T"). 


Theorem 9. Reflections preserve half planes. 


That is, if H, and H are the half planes determined by L, then f(H,) and f(H») 
are the half planes determined by f(L). Proof? 


Theorem 10. Reflections preserve interiors of angles. 


Proof. The interior of Z ABC is the intersection of (1) the side of AB that 


contains C, and (2) the side of BC that contains A. Since reflections preserve half 
planes, they preserve intersections of half planes. 


25.8 ANGULAR MEASURE IN THE POINCARE MODEL 


We have defined the measure of an (non-Euclidean) angle as the measure of the 
(Euclidean) angle formed by the two tangent rays. We need to check whether this 
measure function satisfies the postulates of Section 5.1. For angles with vertex 
at P this is obvious. To verify it for angles with vertex at some other point Q, we 
throw Q onto P by a reflection f. Now f preserves angles, angular measure, lines, 
and interiors of angles. It is therefore quite trivial to check that if Postulates 
M-1 through M-5 hold at P, then they hold at Q. 


25.9 THE SAS POSTULATE 


We have now verified, for the Poincaré model, all the postulates of absolute 
plane geometry, with the sole exception of SAS. With heavy use of inversions, this 
turns out not to be difficult. 


A A' 


B C B' [^4 
Ficure 25.18 
Given A ABC, A A'B'C', and a correspondence 
ABC + A'B'C' 
such that rs (>x * 
AB S A'B', BC = BC’, ZB = ZB’. 


(Here the segments and congruences are non-Euclidean.) We want to show that 
AABC = AA'B'C'. 
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P C, 
Figure 25.19 
First let fı be an inversion such that f(B) = P, and let AA,PC; be f(A A BC). 


(Note that PA and PC; look “straight,” as they should.) Since f; preserves both 
distances and angular measure, we have 


AA,PC, = AABC. 
Next let f? be an inversion such that fo(B’) = P, and let AB PC; = f(A4A'B'C"). 


Ficure 25.20 


It is rather easy to see that there is a reflection fs, across an L-line through P, such 
that fa(PC1) = PO, (If P, Ci, and C1 are not collinear, we reflect across the 
bisector of ZC{PC;, as indicated in the figure. It Ci-P-C 1, we reflect across the 
perpendicular to PC; at P. If we already have PC; = PC, we leave well enough 


alone.) 
Let AAZPC2 = f(AA{PC}). Since fg preserves distance and angular measure, 


bi ui AAPO, = AAYPC. 
And since BC = PC), and B'C' = PC; = PC3, we have C, = Ga. 


A\=A}? 


T C=C 


As? 
Figure 25.21 
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There are now only two possibilities: 

(1) A1ison the same side of PC; as Ay. In this case, since Z A3PC3 = Z Ai1PC;, 
we have PA} = PAj; and since PA, = PA}, we have A; = A3. Thus P = P, 
A, = Aand C, = C$. Therefore 


this fits together with our other congruences to give AABC = AA'B'C'. 

(2) Aj and A, are on opposite sides of PC. In this case we reflect across PO 
and then proceed as in case (1). 

Rather curiously, this proof has a great deal in common, intuitively, with 
Euclid’s “proof” of SAS, by superposition, in Book I of the Elements. You really 
can prove things by superposition if you carry out the process using a family of 
transformations (in this case, the reflections) which are known to preserve the 
properties that you are concerned with. 


RUE 


THE CONSISTENCY OF EUCLIDEAN 
GEOMETRY 


26.1 INTRODUCTION 


Our proof of the consistency of hyperbolie geometry, in the preceding chapter, 
was conditional. We showed that if there is a mathematical system satisfying the 
postulates for Euclidean geometry, then there is à system satisfying the postulates 
for hyperbolie geometry. We shall now investigate the if, by describing a model 
for the Euclidean postulates. Here again our consistency proof will be conditional. 
To set up our model, we shall need to assume that the real number system is given. 


Derinition 1. E = R X R. 
That is, a point is defined to be an ordered pair of real numbers. 


DzriNITION 2. A line is a set of the form 
L = ((z,y)Az + By + € = 0, A? + B? > 0). 


That is, a line is defined to be the graph of a linear equation in z and y. 


Deriniti0n 3. If P = (a1, y1) and Q = (x2, y2), then 


d(P, Q) = VŒ — 21? + (ys — yi)? 
That is, distance is defined by the distance formula which appeared as a theorem 


in Chapter 18. 

We define betweenness in terms of distance. (As usual, we abbreviate d(P, Q) 
as PQ.) Segments and rays are defined in terms of betweenness; and angles are 
defined when rays are known. 

It turns out that setting up an angular measure function is a rather formidable 
technical chore. We hope, therefore, that the reader will agree to settle for a con- 
gruence relation & for angles, satisfying the purely synthetic postulates C-6 
through C-9 of Chapter 8. This relation is defined in the following way. 


Derinirion 4. An isomelry is a one-to-one correspondence 
f:E E, 


preserving distance. 
367 
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Derinition 5. Two angles Z ABC and Z DEF are congruent if there is an 
isometry f : E © E such that f(Z ABC) = ZDEF. 


We have now given definitions, in the Cartesian model, for the terms used in 
the Euclidean postulates. Each of these postulates thus becomes a statement about 
a question of fact; and our task is to show that all of these statements are true. 


26.2 THE RULER POSTULATE 


By a vertical line we mean a line which is the graph of an equation z = a. It is 
easy to check that every nonvertical line is the graph of an equation y = mz + b. 


Theorem 1. Every vertical line L has a coordinate system. 
Proof. For each point P — (a, y) of L, let 
f(P) = y. 
Then f is a one-to-one correspondence L €» R. If P = (a, y1) and Q = (a, ys), 


then 
PQ = d(P,Q) = v (a — a)? + (ya — y)? 
= V2 — y)? 
= lya CY yıl 
= re P : 
as desired. IQ) — f(P)| 


Theorem 2. Every nonvertical line has a coordinate system. 


FIGURE 26.1 


Proof. Let L be the graph of y = mz + b. If (xı, y1) and (x2, y2) € L, then it 
is easy to check that 


=m, ya — yı = m(za — Tı), 


PQ = V (zs — z1)? F mi(z4 — 21)? = V1 + m?) [za — ail. 


and 


—O—t—Á———ÉRB 
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From this we see how to define a coordinate system for L. Let 


fiy) = eV + m. 
Then for 
P = (21,41), Q = (22, Y2) 


PQ = VI + m? |za — zıl 
= |ro/1 + m? — tıy1 -- m? | 
= IQ — f 


we have 


as it should be. 
These two theorems give us: 


Theorem 3. In the Cartesian model, the ruler postulate holds. 


26.3 INCIDENCE AND PARALLELISM 


Theorem 1. Every two points of the Cartesian model lie on a line. 


Proof. Given P = (x1, 41), = (22,92). If v1 = t2 then P and Q lie on the 
vertical line z = a = Tı. If not, then P and Q lie on the graph of the equation 


$us Ji 
22 Tes ve 


y— (æ — 2), 


which is easily seen to be à line. 


Theorem 2. Two lines intersect in at most one point. 


Proof. Given Li and Ls, with Li ~ La. If both are vertical, then they do not 
intersect at all. If one is vertical and the other is not, then the graphs of 


=a, y= mi +b 


intersect at the unique point (a, ma -+ b). Suppose finally, that Ly and Lz are the 


graphs of 
y = mz + bi, y = max + bs. 


If mı # ms, very elementary algebra gives us exactly one common solution and 
hence exactly one intersection point. If m, = me, then b, # be, and the graphs 


do not intersect at all. 
We have already observed that if L is the graph of y = mz + b, then for every 


two points (z1, y1) (2, Y2) of L, we have 


Y2 — Vi m. 
23 = 41 


Thus m is determined by the nonvertical line L. As usual, we call m the slope of L. 
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Theorem 3. Every vertical line intersects every nonvertical line. [At the point 
(a, ma + b). 
By easier algebra we get: 


Theorem 4. Two lines are parallel if and only if (1) both are vertical, or (2) 
neither is vertical, and they have the same slope. 


Proof. Given Lı # Lg. If both are vertical then L; || Lo. If neither is vertical, 
and they have the same slope, then the equations, 


y= meth, y= met be (b # bə), 


have no common solution, and L; || Lg. 

Suppose, conversely, that Lı | Ls. If both are vertical, then (1) holds. It 
remains only to show that if neither line is vertical, they have the same slope. 
Suppose not. Then 


Ly:y=mer+bi, Lery = mgx+ bo — (mi $ mə). 
We can now solve for x and y: 


0 = (mi — ma)z + (by — bə), 


dp bi — be 
m; — mo 

RM bi — bo 

ye m (Bt) +o 


We got this value of y by substituting in the equation of L;. But our x and y also 
satisfy the equation of Ly. This contradicts the hypothesis L, || Lo. 


Theorem 5. Given a point P = (x1, yı) and number m, there is exactly one line 
which passes through P and has slope =m. 


Proof. The lines L with slope m are the graphs of equations 
y = mz b. 


If L contains (x1, yı), then b = y; — mz, and conversely. Therefore our line 
exists and is unique. 


Theorem 6. In the Cartesian model, the Euclidean parallel postulate holds. 
Proof. Given a line L and a point P = (x, yi) not on L. 


(1) If L is the graph of x = a, then the line Z’: x = z; is the only vertical line 
through P, and, by Theorem 3, no nonvertical line is parallel to L. Thus the parallel 
L through P is unique. 

(2) If L is the graph of y = mz + b, then the only parallel to L through P is 
the line through P with slope =m. This is unique. 


TRANSLATIONS AND ROTATIONS — 371 
26.4 TRANSLATIONS AND ROTATIONS 


By a translation of the Cartesian model, we mean a one-to-one correspondence 
f:Eof, 
(my) e (d ay + b). 


Merely by substituting in the distance formula, and observing that a and b cancel 
out, we have: 


Theorem 1. Translations are isometries. 
If L is the graph of the equation 
Az -F By - C — 0, 
then the points (z^, y) = (zc @, Y + b) of f(L) satisfy the equation 
A(z’ — a) + BY’ — 5) +C = 0, 


or 


Aa! + By’ + (—aÀ — bB + C) = 0. 
This is linear. 


Theorem 2. Translations preserve lines. 


Since translations preserve lines and distance, they preserve everything defined 
in terms of lines and distance. 


Theorem 3. Translations preserve betweenness, segments, rays, angles, triangles, 

and angle congruences. 

Rotations are harder to describe, because at this stage we have no trigonometry 
at our disposal. Let us first try using trigonometry, wishfully, to find out what we 
ought to be doing, and then find a way to do something equivalent, using only the 
primitive apparatus that we now have at our disposal in our study of the Cartesian 


model. 


f(P)=(r cos (6+¢), r sin (0-6) 


Ficure 26.2 
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We want to rotate the Cartesian model through an angle of measure ¢ (Fig. 


26.2). Trigonometrically, this can be done by a one-to-one correspondence, 


f:Eok, 
defined as the labels in the figure suggest. 
Now 
cos (0 + $) = cos 6 cos $ — sin 6 sin ¢, 
sin (0 + $) = sin 6 cos $ + cos 0 sin ¢. 
Let 
a = COs $, b = sin ¢. 
Now 
r= vat dy, 
cos 6 = HIA , 
: y 
sin 0 = ————. 
Va? + y? 


We can therefore rewrite our formulas in the form 


f:@y e (^, y), 


where 
x’ = r cos (0 + 4) 
n D Uu 6 
= Vx + y2 ( [x2 F y? Vr? F y? 
= ax — by, 
and 


y x 
yf = vite Vere * vera) 
= ay + be. 


Any correspondence of this form, with a? + b? = 1, is called a rotation of the 
Cartesian model. 


Theorem 4. Rotations preserve distance. 
Proof. We have 
P= (zi, y), 
Q = (z2, y2), 
P= S(P) (az, p- byi, ayı d bz), 
Q' = f(Q) = (are — by», aya + bre). 
It is merely an exercise in patience to substitute in the distance formula, calculate 
P'Q', simplify with the aid of the equation a? + b? = 1, and observe that P'Q' = 
PQ. (The reader is warned that (P'Q^)? appears as a sum of twenty terms.) 


"———— | 
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Solving for z and y in terms of x’ and y', we get 
x = ax’ + by, y = ay’ — bx. 
Comparing the formulas, 
x’ = ax — by, y = br + ay, 
for f and corresponding formulas for f^, we see that these have the same form 
g-arz—WVy, y=oyt V'z', 
where a’ = aand b = —b. Therefore, we have the following theorems. 


Theorem 5. The inverse of a rotation is a rotation. 


Theorem 6. Rotations preserve lines. 
Proof. Listhe graph of an equation 
(1) z =k, 
(2) y = k, 
or 
(3) y= mr +k (m # 0). 
In Case (1), f(L) is the graph of 
ax’ + by = k, 


where a and b are not both =0, because a? + p? = 1. Therefore L is a line. 
In Case (2), f(L) is the graph of 
ay’ — bx! = k, 
which is a line. 
In Case (3), f(L) is the graph of 
ay — bx = max’ + mby’ + k, 


or 
(ma + b)z! + (mb — a)y’ + k = 0. 
If we had both 
ma +b — 0, mb — a = 0, 
then 
ma? + ab = 0, mb? — ab = 0, 


so that 
m(a? + b?) = 0, 


and m = 0, contradicting our hypothesis. 
As for translations, once we know that rotations preserve lines and distance, it 


follows that they preserve everything that is defined in terms of lines and distance. 
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"Therefore we have: | 
Theorem 7. Rotations preserve betweenness, segments, rays, angles, triangles, 
and angle congruences. 


We are going to use rotations in the Cartesian model in much the same way that 
we used reflections in the Poincaré model, to show that postulates for angle congru- 
ence hold. To do this, we shall need to know that every ray starting at the origin 
(0, 0) can be rotated onto the positive end of the z-axis, and vice versa. By Theo- 
rem 5, it will be sufficient to prove the following theorem. 


Theorem 8. Let P = (xo, 0) (x > 0), let Q = (zı, y1), and suppose that 
z3 = zi + yi. 


Then there is a rotation f such that f(P) = Q. 


FiaonE 26.3 | 


The equation in the hypothesis says, of course, that P and Q are equidistant from 
the origin. 


As a guide in setting up such a rotation, we note unofficially that we want to 
rotate E through an angle of measure ¢, where | 


T, 


a= cos ¢ = ——————; 
Vaid yi 
b= sing = — H. 
Vai + yi 


Thus the rotation ought to be 
f:Eok 
1G y) © (x', y), 


where 
z'-az—bm—À— £4. 4 
Vaid yi Vri +y? 
y = bz + ay = n —zr-4 = 


PT ec 
Vaid yi Vai yi 
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Obviously a? + b? = 1 in these equations, and so fisa rotation. And 


Ti yı 
(Lie SS 
Seo 0) (Vara Vai t+ yi ) 
= (z1, Y1) 


which is the result that we wanted. 


26.5 PLANE SEPARATION 


We shall show first that the plane-separation postulate holds for the case in 
which the given line is the z-axis. It will then be easy to get the general case. 
Let EÑ be the “upper half plane.” That is, 


Et = {(z, y)ly > 0}. 


Theorem 1. E* is convex. 


Proof. Lemma 1, Section 3.4, says that if A, B, and C are points of a line, with 
coordinates x, y, and z, and x < y < z, then A-B-C. (This was proved merely 
on the basis of the ruler postulate, and we can therefore apply it now.) Since only 
one of the points A, B, C is between the other two, the lemma has a true converse: 
if A-B-C, then z < y € zorz <% « y. 

Consider now two points, A= (x1, 41), C = (t3 y2) of Et; 


A 
sees 
A 
|i 
| 
| 


Fieure 26.4 


We need to show that AC lies in Et. That is, if A-B-C, with B = (t3, ya), then 
y > 0. Obviously, for the case z, * r9 We may assume that z, < x», as in the 
figure; and for the case tı = 72 we may assume that y; < yz. 

In the first case, the line AC is the graph of an equation 


y = mr + b, 
and has a coordinate system of the form 
f(x,y) = VI + m? z. 
In the second case, the line is the graph of the equation 


g= Tı 
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and has a coordinate system of the form 
S(t, y) = y. 
It is easy to check that in the first case 


f(A) < f(B) < f(C), 
so that 
3i € x3 < 2. 
For m > 0, 
mz, +b < mrs +b < mtz +b; 


for m < 0, the inequalities run the other way; but in either case y; lies betwe 
two positive numbers. In the second case (x; = 2x2), the same result follows ev 
more easily. 
Let E^ be the “lower half plane." That is, 
E~ = ((yly < 0}. 
Since the function, 
f : Ag; y) e (z, 3-1. 
is obviously an isometry, it preserves segments. Therefore it preserves conven u 
Since /(E*) = E^, we have the following theorem. 
Theorem 2. E is convex. 
It is an easy exercise in algebra to show that if A = (zi, yı) € Et, and B= 
(12, y2) € E^, then AB contains a point (x, 0) of the z-axis. Thus: 


"i 


Theorem 3. E and the line z = 0 satisfy the conditions for E and L in the plane | 
separation postulate. 


Now let L be any line in Æ, and let A = (x, y1) be any point of E. By a trans- | 
lation f, we can move A to the origin. By a rotation g, we can move the resulting 
line onto the x-axis. Let 


H=9g'f (0, Hi—g f (EB). 


Since all of the conditions of the plane separation postulate are preserved under | 
isometries, we have the following theorems. 


Theorem 4. E satisfies the conditions of the plane separation postulate. 


Theorem 5. Isometries preserve half planes. 


If f is an n isometry; then f(L) is a line L’. Tag: 


=f), H} = f(H3). 
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Then H, and H are convex, and every segment between two points f(A) of Hi 
and f(B) of H$ must intersect f(L). Therefore H{ is a half plane with L’ as edge. 
From Theorem 5 it follows that: 


Theorem 6. Isometries preserve interiors of angles. 
That is, if I is the interior of Z ABC, then f(J) is the interior of f(Z ABC). 


26.6 ANGLE CONGRUENCES 


We want to verify that angle congruence, defined by means of isometries of E 
onto itself, satisfies the postulates C-6 through C-9, Section 8.1, and also satisfies 
SAS. Only one of these verifications is trivial. 

C-6. For angles, congruence is an equivalence relation. 

Proof. (1) ZA £& ZA always, because the identity function E + E is an 
isometry. (2) If ZA = ZB, then ZB & ZA, because the inverse of an isometry 
isan isometry. (3) If ZA = ZB and ZB = ZC, then ZA & ZC, because the 
composition of the isometries for which ZA e ZB and ZB € ZC is always 
an isometry for which ZA <> ZC. 

The other verifications are more difficult. We begin with a lemma. 


Lemma 1. Let f be an isometry of E onto itself. If f(E*) = Et, and f(P) = P 
for every point P of the z-axis, then f is the identity. 


Proof. Let A be the origin (0, 0), and let B = (1,0). Let Q = (a,b) be any 
point, and let f(Q) = (c, d). Then 


AQ = f(A)FQ), BQ = (B) iQ). 
Taking the square of each of these distances, we get 
a? + b? = c? + d?, 
(à — D? += (c — 1? a, 


so that a = c. Therefore b? = d?. Since f(E*) = E*, b and d are both positive, 
both zero, or both negative. Therefore b — d. Thus f(Q) = Q for every Q, which 
was to be proved. 


Lemma 2. Let A be the origin; let B = (a, 0), (a > 0) be a point of the x-axis; 
and let C = (b, c) and D = (d, e) be points of E* and E” such that 


AC = AD, . BC = BD. 


Then there is an isometry 
f:EeE 


such that f(A) = A, f(B) = B,f(C) = D, and f(D) = C. 
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E 
C-(b, c) | 


D (d, e) 
Figure 26.5 


Proof. We shall show that d = b and e = —c. The desired isometry f will then 
be the function (x, y) €» (x, —y). 
Given 
b? + c? = d? + e, 
(b — a)? + c? = (d — a)? +4 e?, 


we have —2ab = —2ad. Since a > 0, this gives b = d. Therefore c? = e°. 
Since c > 0 and e < 0, we have e = —c. 


lemma 3. Given ZABC, there is an isometry f of E onto itself such that JÈ) = 

BÓ and f(BÓ) — BA. That is, the sides of the angle can be interchanged by an 

isometry. 

In the proof, we may suppose that BA = BC, since A and C can always be 
chosen so as to satisfy this condition. 


B G 
Figure 26.6 


Let D be the midpoint of AC. Using a translation followed by a rotation, we get 


an isometry g : E €» E such that e (BD) i is the positive end of the z-axis (Fig. 26.7). 
(First we translate B to the origin, and then we rotate.) By the preceding lemma 
there is an isometry h : E +> E, interchanging A’ and C', and leaving B’ and D’ 


fixed. Let 2 
J = hg. 


That is, f is the composition of g, h, and g~!. Then f is an isometry; f(B) = 
f(A) = C and f(C) = A. 
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Ficure 26.7 


It is now easy to verify the rest of our congruence postulates. Oddly enough, the 
easiest is SAS. We put this in the style of a restatement. 
SAS. Given A ABC, AA'B'C', and a correspondence 
ABC © A'B'C'. 
If (1) AB = A'B', (2) ZB = ZB’, and (3) BC = B'C', then (4) ZA = ZA’, 
(5) ZC = ZC’, and (6) AC = AC’. 


Fieure 26.8 


Proof. By hypothesis (2), there is an isometry E > E, ZB <> ZB’. By Lemma 3 
it follows that there is an isometry 
f:EoE 
:B e B' 
— ——À 
: BA e B'A' 
— -—> 
: BC  B'C'. 
(If the given isometry moves Z.B onto ZB’ in "the wrong way,” then we follow 
it by an isometry which interchanges the sides of ZB'.) From (1) it follows that 
A’ = f(A) and C’ = f(C). Therefore LA’ = f(ZA), and ZA’ = ZA; ZC’ = 
f(ZC), and ZC’ = ZC. Also AC = A'C', because f is an isometry. 
This proof bears a certain resemblance to Euclid’s “proof” of SAS by super- 
position. 
C-7. Let Z ABC be an angle, let B'C' be a ray, and let H be a half plane whose 


edge contains B'C'. Then there is exactly one ray B'A', with A’ in H, such that 
LABO = Z7 A'B'C'. 
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We give the proof merely in outline. It should be understood that all of the 
functions mentioned are isometries of E onto E, and that the ray R is the positive 
x-axis. 


(1) Take fı so that fı(B'C') = R. 

(2) Take fz so that fo(R) = R and fafi (H) = E*. (Of course, if fı(H) is 
already —E*, we let fo be the identity.) 

(3). Take g so that g;(BC) = R. 

(4) Take go so that go(R) = R and go(A) is in E*. 

(5) Let Lr = ja! fi gagi Z ABC). Then Z is the Z A'B'C' that we wanted. 

(6) Suppose that there are two rays B' A’, B'A” satisfying these conditions. 


Figure 26.9 


Then fof; moves the whole figure into the position shown in Fig. 26.9, where we 
supposedly have KL # KM but ZLKN = ZMKN. By Lemma 3, it follows 
^ : —À — —> — 
that there is an isometry f such that f(KN) = KN and f(KL) = KM. It follows 
that f(P) = P for each point P of the z-axis; since isometries preserve half planes, 
we have f(E*) = Et. By Lemma 1, it follows that f is the identity. This contra- 

dicts our hypothesis KL # KM. 
C-8. If (1) D is in the interior of ZBAC, (2) D' is in the interior of Z.B'A'C', 


(3) ZBAD = ZB'A'D', and (4) ZDAC = Z.D'A'C', then (5) ZBAC = 
Z B' A'C'. 


A E A C 
Figure 26.10 


Proof. (1) By an isometry f, we move AD onto R and B into E*. [For this we 
need a translation, followed by a rotation and perhaps a reflection (x, y) * (2, EB! 


ANGLE CONGRUENCES 381 


(2) By an isometry g, we move AD’ onto R on B' into E*. 
(3) By the uniqueness condition in the preceding postulate, we know that 
— ——3 — —> 
(AB) = g(A’B’) and f(AC) = g(A'C^. 
(4) Therefore f(ZBAC) = g(ZB' AC’). Therefore ZBAC = ZB'A'C'; the 
required isometry is guy: 
C-9. If (1) D is in the interior of ZBAC, (2) D' is in the interior of Z B'A'C', 
(3 ZBAD & ZB'A'D' and (4) ZBAC = ZB'A'C', then (5) ZDAC = 
ZZ DEAS 


A C A’ c 
Figure 26.11 


Proof. Let f be the isometry given by (4), so that f(ZBAC) = ZB'A'C'. By 
—. —3À —> —> 

Lemma (3) we may suppose that f(AB) = A'B' and f(AC) = A'C'. Then surely 

S(ZBAD) c ZBAD. The uniqueness condition in C-7 therefore tells us that ` 

f(AD) — A'D'. Therefore f(Z DAC) = ZD'A'C', and ZDAC = ZDAC, 

which was to be proved. 
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THE POSTULATIONAL METHOD 


27.1 INTRODUCTION 


In this chapter we give a general discussion of the ways in which sets of postulates 
are used in mathematics, with illustrations from preceding chapters. The general 
discussion has been postponed until now precisely so that the illustrations could 
be cited. The postulational method is used in a number of quite different ways, 
and the distinctions among these are rather difficult to explain in the abstract. 


27.2 POSTULATES CONSIDERED AS SELF-EVIDENT TRUTHS 


In the time of Euclid, and for over two thousand years thereafter, the postulates 
of geometry were thought of as self-evident truths about physical space; and 
geometry was thought of as a kind of purely deductive physics. Starting with the 
truths that were self-evident, geometers considered that they were deducing other 
and more obscure truths without the possibility of error. (Here, of course, we are 
not counting the casual errors of individuals, which in mathematics are nearly 
always corrected rather promptly.) This conception of the enterprise in which 
geometers were engaged appeared to rest on firmer and firmer ground as the cen- 
turies wore on. As the other sciences developed, it became plain that in their earlier 
stages they had fallen into fundamental errors. Meanwhile the “self-evident 
truths” of geometry continued to look like truths, and also continued to seem self- 
evident. 

With the development of hyperbolic geometry, however, this view became 
untenable. We then had two different, and mutually incompatible, systems of 
geometry. Each of them was mathematically self-consistent, and each of them was 
compatible with our observations of the physical world. From this point on, the 
whole discussion of the relation between geometry and physical space was carried 
on in quite different terms. We now think not of a unique, physically “true” 
geometry, but of a number of mathematical geometries, each of which may be a good 
or bad approximation of physical space, and each of which may be useful in various 
physical investigations. Thus we have lost our faith not only in the idea that simple 
and fundamental truths can be relied upon to be self-evident, but also in the idea 
that geometry is an aspect of physics. 

This philosophical revolution is reflected, oddly enough, in the differences be- 
tween the early passages of the Declaration of Independence and the Gettysburg 
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Address. Thomas Jefferson wrote: 

“We hold these truths to be self-evident, that all men are created equal, 
that they are endowed by their creator with certain unalienable rights, that among 
these are Life, Liberty and the pursuit of Happiness... .” 

The spirit of these remarks is Euclidean. From his postulates, Jefferson went 
on to deduce a nontrivial theorem, to the effect that the American colonies had the 
right to establish their independence by force of arms. 

Lincoln spoke in a quite different style: 

“Pourscore and seven years ago our fathers brought forth on this continent a 
new nation, conceived in liberty and dedicated to the proposition that all men are 
created equal.” 

Here Lincoln is referring to one of the propositions mentioned by Jefferson, but 
he is not claiming, as Jefferson did, that this proposition is self-evidently true, or 
even that it is true at all. He refers to it merely as a proposition to which a certain 
nation was dedicated. Thus, to Lincoln, this proposition is a description of a 
certain aspect of the United States (and, of course, an aspect of himself). (I am 
indebted for this observation to Lipman Bers.) 

This is not to say that Lincoln was a reader of Lobachevsky, Bolyai or Gauss, 
or that he was influenced, even at several removes, by people who were. It seems 
more likely that a shift in philosophy had been developing independently of the 
mathematicians, and that this helped to give mathematicians the courage to under- 
take non-Euclidean investigations and publish the results. 

At any rate, modern mathematicians use postulates in the spirit of Lincoln. 
The question whether the postulates are “true ” does not even arise. Sets of postu- 
lates are regarded merely as descriptions of mathematical structures. Their value 
consists in the fact that they are practical aids in the study of the mathematical 
structures that they describe. 


27.3 CATEGORIC POSTULATE SETS 


It sometimes happens that a postulate set gives a complete description of a 
mathematical structure, in the sense that any two structures that satisfy all of the 
postulates are essentially the same. Rather than attempting to give a definition 
of the phrase “essentially the same,” let us look at an example. 


Let 
[F, +, :] 
IF, aie af 


be two algebraic structures satisfying the field postulates of Section 1.3, and satis- 
fying the further postulate that each of the sets F and F' have exactly two elements. 
Let 0 be the element of F given by A-3, and let 0’ be the element of F’ given by 
A-3. Similarly, let 1 and 1’ be the elements of F and F' given by M-3. We can then 
set up a one-to-one correspondence 


[:9 0, 


:1 €» 1^ 


and 
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it is easy to check that f preserves both sums and products, both ways. This is 
what we mean when we say that [F,+, -] and [F’, +’, ”] are essentially the same. 
For algebraic structures, of course, this relation is called isomorphism. ‘Thus a set 
of algebraic postulates is categoric if any two algebraic structures that satisfy the 
postulates are isomorphic. 

Obviously the postulates for a field are not categoric, and neither are the postu- 
lates for an ordered field; they are satisfied by the rationals, by the surds, and by 
the real numbers, not to mention the non-Archimedean ordered field described in 
Chapter 28. 

Similarly, the postulates for synthetic plane geometry are not categoric. They 
are satisfied by a metric plane, in which the ruler postulate tells us that we have 
one-to-one coordinate functions 

f:LoR 


for every line. The same postulates are also satisfied in the surd plane, in which 
every line is everywhere full of holes. We can, however, get a categoric postulate 
set by adding two more postulates. 

We recall the metrization theorem of Section 20.6. Given a plane 


(E, £, =, 8] 


satisfying the synthetic postulates, and also satisfying the geometric form of the 
Archimedean postulate, we can always introduce a distance function 


d:EXE—R, 


which gives back to us the congruence and betweenness relations that we started 
with. (It would be worth the reader’s while to review Section 20.6, at this point, 
because we are going to make heavy use of it.) Under the distance function given 
by the metrization theorem, the ruler postulate is almost satisfied; we have “coor- 
dinate functions” 

f:LoR 


d(P,Q) = |f(P) — KQ), 


but we have no guarantee that all real numbers get used as coordinates. That is, 
the coordinate functions may be functions into R, and may fail to be one-to-one 
correspondences. If it happens that all coordinate systems on all lines are one-to-one 
correspondences, then we say that the plane that we started with is complete in the 
sense of Dedekind. 

This suggests a way to get a categorie set of postulates for synthetic plane 
geometry: we should add, to the usual postulates, the conditions: 


(1) E is Archimedean. 
(II) E is complete in the sense of Dedekind. 


Any two structures 


such that 


[E,£,&, e],  [B’, £', =’, 8) 
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which satisfy all of these conditions are essentially the same. Oddly enough, this 
is not very difficult to prove, once we have come this far in this book. 
By the methods of Chapter 18, we set up coordinate systems in E and E'. Thus 
we have correspondences 
EoRXR, P e (x,y) 
and 
E oRXR, P' e (xz, y). 


These really are one to one, because both of our planes are complete in the sense 
of Dedekind. For each point P of E, we let P' = f(P) be the point of 2’ which 
‘has the same coordinates (x, y) as P. Then 


(1) f preserves distance. 
That is, d(P, Q) = d'(P, Q^), for all points P, Q of E. 


Proof. Let P and Q have coordinates (x1, y1) and (r2, y2). Then P’ and Q’ have 
the same coordinates (x1, y1) and (22, Y2). Therefore 


dP, Q) = va — #1)? + G2 — y)? = dC", Q’). 
(2) f preserves lines. * 
That is, L is à line in E if and only if f(L) is a line in E". 
Proof. If Lis a line in E, then L is the graph of a linear equation 
Az + By+C = 0. 


It then follows that f(L) is the graph of the same linear equation. Therefore f(L) 
is a line in Æ’. The same proof works in reverse. 


(3) f preserves betweenness. 
That is, A-B-C in E if and only if A’-B’-C’ in F’. 


Proof. Each of the following statements is equivalent to the next: 

(a) A-B-C in E. 

(b) A, B, and C are collinear, and d(A, B) + d(B, C) = d(A, C). 

(c) A’, B', and C’ are collinear, and d’(A’, B') + d'(B’,C’) = d'(A', C^). 
(d) A'-B'-C' in E'. 


(4) f preserves segments, rays, and angles, because these are defined in terms of 
betweenness. 


(5) f preserves congruence between segments. 


Proof. Each 1 of the following statements is equivalent to the next: 
(a) AB = CD. 

(b) d(A, B) = d(C, D). 

(c) d/(A', B^) = d'(C', D^). 

(d) AB! = CD'. 
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(6) f preserves congruence between angles. 
That is, L ABC = ZDEF if and only if f(Z ABC) =" f( Z DEF). 


Proof. First we observe tnat 
f(Z ABC) = ZA'BIC’ 
and 
f(Z DEF) = ZD'E'F', 


by (4). We are free to choose D and F on the sides of Z DEF, so that DE = AB 
and EF = BC. By SAS it follows that AC = DF. Therefore 


D'E' = A'B', E'F' = Bie, AC = D'F' 


because f preserves distance. (Here we are using the short notation for distance in 
both of our planes.) By SSS it follows that 


AA'B'C' = AD'E'F', 
Z A'B'C' = ZD'E'F', 


and 


which was to be proved. 

Thus f preserves all the structure mentioned in our postulates. This means that 
our postulates for synthetie plane geometry became categorie once we had added 
(1) the Archimedean postulate and (2) the Dedekind postulate. 

A categorie postulate set is a sort of arch of triumph. When we are able to write 
such a postulate set for a particular mathematical structure, this means that we 
have a complete understanding of its essential properties. Note, for example, in the 
ease of synthetic plane geometry, that we did not know what conditions to add to 
make our postulates categorie, until we had gone through the rather deep and 
difficult discussion in Chapter 20. 


27.4 THE USE OF POSTULATE SETS AS CODIFICATIONS 


In fact, categoric postulate sets are rather rare. Most of the time, when we write 
down a set of postulates, we do so not to get a complete description of a particular 
mathematical system, but for precisely the opposite purpose. Most of the time, 
the value of the postulates lies in their generality: they describe a common aspect 
of various mathematical systems which may have little else in common. 

One striking example of this is the idea of a group. (A group, of course, is & 
pair [F, -] satisfying M-1 through M-4 of Section 1.3.) Once we have proved à 
theorem about groups in general, on the basis of these four postulates, we are free 
to apply the theorem in an immense variety of contexts. This process gives an 
efficient codification of mathematics; it spares us the job of repeating essentially 
the same proof over and over. (Often it leads us to simpler proofs, because it tends 
to protect us from being distracted by irrelevancies.) 

To some extent, postulates have been used in this way in this book. For example, 
the postulates of metric absolute plane geometry are not categorie; they allow the 
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possibility that the plane is either Euclidean or hyperbolic. In Chapters 1 through 
7 and 10, we used neither of the two possible parallel postulates. We were then free 
to use the resulting theorems both in Euclidean geometry (Chapters 11 through 23, 
and 25) and in hyperbolic geometry (Chapter 24). The resulting economy was 
considerable. If we had'introduced the Euclidean parallel postulate earlier, it 
would have become necessary to do some of our work all over again in Chapter 
24. Suppose, for example, that we had postponed the study of geometric inequal- 
ities until after Chapter 11, and had proved the exterior angle theorem in the 
way suggested by the following figure: 


A 
uw 
B Cc D 
r+s+t=180; ft+u=180; u=r+s; 
u>r, u>s. 
FIGURE 27.1 


It would have become necessary to develop the theory of geometric inequalities 
all over again in Chapter 24. 

Similarly, we used the postulates for a Euclidean ordered field in Chapters 1, 3, 
6, and Chapters 10 through 18. These postulates allow both the complete real 
number system and the surd field, and therefore our results could be used in both 
Chapters 19 and 20. In Chapter 20 we introduced the Dedekind postulate, at the 
point where we really needed it. Thus, at most points in this book, the usefulness 
of our postulate sets has been due to the fact that they were not categoric. 


27.5 THE USE OF POSTULATES AS A MATTER OF CONVENIENCE 


Often we use postulates when they aren’t logically necessary at all. For example, 
we showed in Chapter 14 that all the area postulates of Chapter 13 were superfluous 
because we could prove that in any metric geometry there has to be an area func- 
tion satisfying these area postulates. The postulates for volume, in Chapter 23, 
are equally superfluous, for the same sort of reason, although we have not proved 
the fact in this book. 

In each of these cases, we introduced new postulates merely to avoid (or to post- 
pone) difficult proofs. 


diu 20 


AN EXAMPLE OF AN ORDERED FIELD 
WHICH IS NOT ARCHIMEDEAN 


In Chapter 1 we postulated that the real number system had the Archimedean 
property. In fact, this postulate was necessary; it is not true that every ordered 
field satisfies the Archimedean postulate. Here we give an example of such a 
peculiar field. 

First let P be the set of all polynomials with real coefficients. Thus P includes 
the “zero polynomial,” which is =0 for every x, the constant polynomials f(x) = ao; 
and the polynomials of degree n > 0, of the form 


f(x) = ane” + a, 2^7 + +++ + aix - ao, 


where a, Æ 0. In any case, a, is called the leading coefficient. Thus the leading 
coefficient of f(z) = 2x? — 3x + 4 is 2, and the leading coefficient of a constant 
polynomial f(x) = ao is the same constant a. 

For the purposes of this chapter, a polynomial will be called positive if it takes 
on only positive values, when z is sufficiently large. To be more precise, f is posi- 
tive if there is a number k such that 


f(x) > 0 for every x > k. 
For example, f(x) = z? — 2 is positive, because 
z?—2»0  forevery x > V2. 


We are using k = 4/2. Similarly, f(z) = z* + x? + 1 is positive; here any num- 
ber at all can be used as k, because z* + z? + 1 > 0 for every x. 
Theorem 1. f > 0 if and only if the leading coefficient in f is 70. 
Proof. For constant polynomials, this is obvious. Suppose, then, that 
f(a) = ant” + a4 4277! + -+ aix + ao, 
with a, = 0. Then 


fe) = oat [t+ Sat Sek EE 
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Here the first factor is always >0 when x > 0. And the expression in the brackets 
is >0 whenever 


üs—1 05—2 ao 
ad eee T 
Surely this holds when z is greater than a certain k, because as x — oc, each of 
the terms on the left 0. (For a complete proof of this, see a calculus book.) 

On the other hand, if an < 0, then anx” < 0 when x > 0, and the bracket is 
still >0 when x is greater than a certain k. 

Thus the positive polynomials are simply the ones that have positive numbers 
as their leading coefficients. 

A polynomial f is called negative if there is a number k such that 


f(x) < 0 for every x > k. 


From Theorem 1 it follows immediately that: 


Theorem 2. Every polynomial (other than the zero polynomial) is either posi- 

tive or negative. If f > 0, then —f < 0, and conversely. 

The algebraic system formed by our set P of polynomials has nearly all the 
properties that we are looking for. It has a 0, namely, the polynomial f = 0 which 
is =O for every x. It has a 1, namely, the polynomial g = 1. Addition and multi- 
plication are associative, commutative, and distributive, because the real numbers 
have these properties. We easily define an order relation, by defining 


f<g 
g—-f>0. 


to mean that 


Thus f < g if f(z) < g(x) for every x greater than a certain k. It can also be 
checked that Conditions O-1, 0-2, AO-1, and MO-1 hold. 'Thus P forms an ordered 
commutative ring with unity: it satisfies all of the postulates for an ordered field, 
with the sole exception of the postulate which says that every f ~ 0 has a re- 
ciprocal. 

(We omit the details.of these verifications, but will give the analogous details 
presently, for the ordered field which we shall finally be interested in.) 

On the other hand, P is surely not Archimedean. Take, for example 


e=f, f(z) =1  forevery z, 
and 
M=4q, ga) = x for every x. 


No matter what the integer n may be, we have 


ner when Sn. 
Therefore 
ne < M 


for every n, and our e and M do not satisfy the Archimedean postulate. We may 
say that f(x) = zr is "infinitely large compared with" g(x) = 1. 
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In the same way, z? > nz when r is large enough. Therefore, if e(x) = a 
M(z) = z?, then E 


ne « M forevery m. 


Thus there is an ordered ring which is not Archimedean. To get a non-Archimedea 
ordered field, we use the simple device of forming quotients of polynomials. E 
By a rational function we mean a function r of the form 


where f and g are polynomials and g # 0. A rational function r is called positi 
if there is a number k such that 


r(x) > 0 = forevery z > k. 


k such that 
r(z) <0  forevery x >k. 


Theorem 3. Every rational function (other than 0) is either >0 or <0. 
Proof. Given 


where f is not the zero polynomial. We can surely choose f and g so that g > 0. 
(If this does not already hold, we multiply in numerator and denominator by —1.). 
Thus we have a kı such that 


g(t) > 0  forevery x > kı. 
Suppose now that f > 0. Then there is a ka such that 
f(z) 0  1Éorevery x > ks. 


Let k be the larger of the numbers k; and ks. If z > k, then we have x > kı am 
x > kə. Therefore f(x) > 0 and g(x) > 0. Hence 


tæ) = m 20 for every «> k. 


Similarly, if f < 0 and g > 0 it turns out that r < 0. 


Theorem 4. Ifr > Oands > 0, then r + s > Oandrs > 0. 
Proof. Given 


r(z) > 0 forevery x > ky 
s(x) >0  forevery x > kz. 
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Let k be the larger of the numbers k; and kg. For every x > k, we then have 


r(x) + s(x) > 0, 
and 
r(z)s(x) > 0. 


Therefore r + s > 0 and rs > 0, which was to be proved. 
It is easy, of course, for two rational functions to be essentially the same. For 


example, consider 


E a(x — 2) 
ne) i — DS — 29. 
a(x — 3) 


nG-u-1)6:—393 


The first of these is defined except at x = 1 and z = 2; and the second is defined 
except at x = land x = 3. Wherever both functions are defined, they have the 
same value. Two rational functions which are related in this way are called equiv- 
alent. More precisely, 
Tg 6v/T2a 
if 
ri(z) = ra(z), 

except perhaps at a finite number of points. The following theorem is easy to see. 

Theorem 5. If rı ~ rs and sı ~ 85, then riT2 ~ 8132 and rı + re ~ 81 + 82. 

If rı ~ rg and rı > 0, then r2 > 0. 

The set of all rational functions equivalent to r is denoted by 7. Thus 


z= {ss~ r}. 
By the preceding theorem, we can state the following definitions. 


(1) F > Oifr > 0. 
2)7?--s—r-Fs 
(3) 7.8 = rs. 


The point is that positiveness, sums, and products depend only on the equiva- 
lence élasses 7, 3, and do not depend on their representatives r and s. From Theo- 
rem 4 we now get: 


Theorem 6. If 7 > O0 ands > 0, then? + 8 > O and rs > 0. 


Now let F be the set of all equivalence classes F. We assert that F forms a field. 
The associative, commutative, and distributive laws hold in F, because they 
hold for real numbers. For example, 


(rs) = r(st), 
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because for all but a finite number of real numbers z, we have 


[r(x)s(x)]e(z) = r(x)[s(x)ét(x)]. 
Therefore 
(rs)t = r(st). 
Hence 


(ra)t = r(st), 


by definition of multiplication for 7, 8, î. Similarly for the other postulates which 
merely state algebraic identities. 
The set F contains a 0 and a 1, namely 0 and T. Also —7 = =? and 


1_ (1), 
TON 
Obviously F is closed under addition and multiplication, because sums and products 
of rational functions are always rational functions. 
We must now define an order relation in F, and show that F forms an ordered 
field. 
Given 7, 3, F. If 
3—F>0, 
then (by definition), 
P 


For every F and 8, we have exactly one of the conditions 
af. >.0; 8—7-—0, SE x. 
"Therefore we have exactly one of the conditions 
F<; T—3 S«? 
Thus our relation < satisfies O-1. If 


Va oe). on E 
then 


3—F>0O and i—3»0. 


By Theorem 6, we have 


(8—7)--( — 3) » 0, 
so that 
i—F>0, 
and 
rn 


Therefore the relation « satisfies O-2. 


— —— 
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It remains to verify AO-1 and MO-1. 


Given 
P8 
we want to conclude that 
F+i<s+i. 
The first condition says that 
3—F>0. 


The second condition says that 
(84-2) —(F+8>0. 


Obviously these are equivalent. Therefore AO-1 holds; we already know from 
Theorem 6 that MO-1 holds. Therefore F forms an ordered field. 

But F is not Archimedean. To show this, we proceed in exactly the same way 
that we did for P. Let e be the function which is =1 for every x, and let M be 
the function which is =a for every x. For every positive integer n, ne(z) = n 
for every x. Therefore, for every n we have 


M(x) — ne(z) = ra(z) = x — n. 


Now x — > 0 when x > n. Therefore r, is a positive function for every m. 
Therefore, for every n, we have i 
M — m > 0. 


Therefore 
nz < M 


for every n. Thus F is not an Archimedean ordered field; 2 is so exceedingly small, 
compared with M, that no integral multiple of 2 is >M. 
In the same way, if we let 
Tax) = 2", 


it is rather easy to see that the equivalence classes 71, 72, . . . are not related in an 
Archimedean fashion in F. In fact, every integral multiple n7; of 7, is «T5; we 
always have n7? < T3, and so on. 

The above is the simplest example of the sort that the author knows of; but it 
must be confessed that most simple does not mean very simple. 
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THE THEORY OF NUMBERS 


We recall, from Chapter 1, that the set N of positive integers is defined by the 
following three conditions. 


(1) N contains 1. 
(2) N is closed under the operation of adding 1. 
(3) N is the intersection of all sets of numbers satisfying (1) and (2). 


On the basis of this definition of N, we immediately got the following: 


Induction Principle. Let S be a set of numbers. If (1) S contains 1, and (2) S is 
closed under the operation of adding 1, then (3) S contains all of the positive 
integers. 


We then defined the set Z of integers as the set whose elements are the positive 
integers, their negatives, and 0, and we showed that Z formed a commutative ring 
with unity. 

We shall now investigate the divisibility and factorization properties of the 
integers. Our first step is to observe that we can always divide one positive integer 
by another, getting a quotient g and a remainder r. More precisely, without men- 
tioning division, we can state the following theorem. 


Theorem 1. Let n and a be positive integers. Then n can be expressed in the 
form 

n — aq +r, 
where 

0 d nee Gi 


The easiest formal proof is by induction. Take a as fixed, and let § be the set of 
all positive integers n which can be expressed in the desired form. Then S contains 
1, because 

=a-:0+1 (a > 1) 
or 
121:14-0.- (a2 1). 


And S is closed under the operation of adding 1. Given 


n — aq +r, 0src«a, 
we have 
n T 1 = aq+ (r+ 1) 
394 
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which is what we wanted, unless r = a — 1. Ifr = a — 1, then 


n+1l=aq+a 
= alq + 1), 
which has the desired form. 
An integer d # 0 is a divisor of an integer a if a = dq for some integer g; that is, 
d divides a if a/d is an integer. If d divides both a and b, then d is a common divisor 
of aand b. If it is also true that every common divisor of a and b is also a divisor 
of d, then d is a greatest common divisor of a and b, and we write 


d = gcd (a, b). 


Note that this definition does not say merely that (1) d divides both a and b, 
and (2) d is the largest number that divides both a and b. When we say that 
d = ged (a, b), we mean that d is “largest” in the sense of divisibility; that is, 
every common divisor of a and b must not only be <d but must also be a divisor 
of d. Thus, while it is plain that no pair of numbers a, b can have more than one 
ged, it is not plain that a and b have any gcd’s at all; and so the following theorem 
is not trivial. 


Theorem 2. Any two positive integers have a greatest common divisor. 


Proof. Let a and b be the two numbers. Consider the set D of all positive integers 
that can be written in the form 
Ma + Nb, 


where M and N are any integers, positive, negative, or zero. Obviously D is not 
empty, because the positive integer a can be written as 


1:a-- 0-b. 
By the well-ordering principle, D has a least element d. We shall prove that 
d = gcd (a, b). 
(1) d divides b. Suppose that d does not divide b. Then 


b — dq r, Osr<d. 
We know that 
Ma+Nb=d, 
for some integers M, N, and 
r=b— dq. 
Therefore 
r= b — (Ma + Nb)q 


= (—Mq)a + (1 — Nq)b. 
Therefore r € D. This is impossible, because 0 S r < d, and d was supposed to 
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be the least positive element of D. Therefore d divides b. In exactly the same way 
we get 


(2) d divides a. 


Therefore d is a common divisor of a and b. If e is any other common divisor of 
a and b, then 
d = Mpe + Nge = (Mp + Ngye, 


so that e divides d. 
Note that in the proof of Theorem 2, we had d € D. This gives us the following 
theorem. 


Theorem 3. Let a and b be any positive integers. Then there are integers M, 
N such that 
Ma + Nb = ged (a, b). 


We can now verify a theorem that we all have been taking for granted for years. 


Theorem 4. Every rational number can be expressed as a fraction in lowest 
terms. That is, given a rational number p/q, there are always integers r and s 
such that 


B ut 
qi UR 
and 
ged (r,s) = 1. 
To prove this, we let 
= ged (p, 4), 


so that 
p= rd, q — sd 


for some positive integers r and s. Now r and s are the integers that we wanted. 
Obviously 


PL. 
dU aM. Los 
And since 
Mp-4- Nq—d 
for some M, N, we have 
Mrd + Nsd = d, 
and 
Mr + Ns = 1, 
so that 
ged (r,s) = 1. 


A prime number is a positive integer p > 1 whose only positive divisors are itself 
and 1. 
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Theorem 5. If n divides ab, and ged (n, a) = 1, then n divides b. 
(Here n, a, and b are positive integers.) 
Proof. There are integers M and N such that 
Mn + Na — 1. 
"Therefore 


Mnb + Nab = b. 


Since n divides both Mnb and Nab, it divides their sum. 


Theorem 6. Let a and b be positive integers, and let p be a prime. If p divides 
ab, then either p divides a or p divides b. 


Proof. We need to show that if p does not divide a, then p divides b. If p does not 
divide a, then ged (p, a) = 1, because p is a prime. By the preceding theorem, 
p divides b. 

The above is all of the number theory that we shall really need for the purposes 
of this book. Once we have gotten this far, however, we may as well prove the 
unique factorization theorem. 


Theorem 7. Every natural number greater than one can be expressed as a product 
of primes. 
Here repeated factors are allowed. For example, 
12 = 2-3-2 
is a product of primes. 


Proof. Suppose that the theorem is false. Then some number is not a product 
of primes. Let n be the smallest such number. Then n is not a prime. Therefore n 
has some divisor a, different from n and from 1. Thus 


n = ab, lean, 1<b<n. 


Since n was the smallest number for which the theorem fails, a and b are products 
of primes. Therefore n is also a product of primes, and this contradicts our hy- 
pothesis. 

It follows that every natural number n can be expressed in the form 


n = pip? pk, 


where the p;'s are different primes, and the a;'s are natural numbers. Such a fac- 
torization is in standard form if the p;'s are written in order of magnitude, that is, if 


Pi < P2 €: € pe 


Theorem 8. The factorization of a natural number into primes is unique, except 
for the order of the factors. 
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Proof. Suppose that 


n = pp? pë = qid? qj 


induction hypothesis, that every number less than n has only one prime factoriza- 
tion in standard form. We shall show, on this basis, that k = j, p; = q; for each 
i, and a; = bj for each i. 

Every prime factor p of n divides one of the factors pft. Therefore p divides 
some p;. Therefore p is some p;. Therefore p, is the smallest prime factor of n. 
For the same reason, qı is the smallest prime factor of n. Therefore p, = qi. 
"Therefore 

a,;—1, a by—1 


n 
— = pt pr + pee = gi 


ba... gi. 
pi 25 ud 


But n/p; € n. Therefore n/p; has only one factorization in standard form. 
Therefore k = j, p; = qi for each i, and a; = b; for each i. Therefore our “two” 
factorizations of n were the same all along, which was to be proved. 


PROBLEM SET 


where both of the indicated factorizations are in standard form. Suppose, as an 
1. Given two natural numbers rj, r2. Let qı and rg be such that 
ri = tag + ra | OS rs < ra. 
(That is, divide r2 into r1, to get a quotient and a remainder.) Let g2 and r4 be such that 
T2 = Tsq2-F TA (0 S r4 < r3). 


Proceed in this way until you get a last positive remainder ra: 


Ti = regi + 73 
T2 = rsq2-+ T4 


Ti = rindi + i42 


Ta—2 = Ta—1Qn—2 + Tn 
Ta-1 = TrQn—1- 


Of course, the process must terminate, because the r;'s are all positive, and they form a 
decreasing sequence. 
Show that 


Ta = ged (ri, r2). 
2. Show that if n is a natural number, then 4/n is either a natural number or an irra- 
tional number. 


3. Theorem 2 tells us that certain integers M and N must exist, but it gives us no help 
in finding such a pair of numbers. Describe a scheme for finding them. 
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4. Find integers M and N, so that 
41M + 31N = 1. 


5. In the proof of Theorem 8, we appealed tacitly to the following theorem. 


Theorem. If a and b are positive integers, and a > 1, then b < ab. Prove this. 


6. Justify the following statement from Problem 1: “Of course, the process must termi- 
nate, because the r;'s are all positive, and they form a decreasing sequence.” (Remember 


the well-ordering principle.) 


En. OU 


THE THEORY OF EQUATIONS 


For the minimum purposes of this book, it would suffice to discuss the theory 
of equations only for cases where all of the roots of our equations are real. But 
this would be hopelessly artificial and somewhat misleading. Throughout this 
chapter, therefore, when we speak of numbers, we allow the possibility that the 
numbers are complex, unless the contrary is explicitly stated. The set of all com- 
plex numbers is denoted by C. We assume that C forms a field. 

As usual, a polynomial of degree n is a function 


{:c->C 
of the form 
f(z) = anz” N ER 71 ere E a? + ao. 


For n 2 1, we require that a, ~ 0. Thus the zero polynomial is allowed, as a 
polynomial of degree 0, but we don’t call 0 - 2? + z + 1 a polynomial of degree 2. 
Note that we are allowing the coefficients a; to be complex. 


Theorem 1. Let f be a polynomial of degree n = 1, and let zy be any number. 
Then f can be expressed in the form 

S(2) = a(z)(« — zo) + r, 
where q is a polynomial of degree n — 1 andr € C. 


For reasons which will soon be plain, it is imperative to avoid mentioning divi- 
sion in stating this theorem. The proof is by induction. Let S be the set of all 
positive integers n for which it is true that every polynomial of degree n has the 
desired property. Then S contains 1, because 


az + ag = a(z — zo) + (aizo + ac). 
Here q(2) = a; for every z, and r = aizo + ao. 
IíneS,thenn--1eS. 
Proof. Given 


Ín4i(2) = a5 4127*! + az" + +++ + az + o. 
Then 


Sn4i(2) = Gn412"(2 — 29) + (an412o + a4)2" - +++ + aiz + ao 


= Qs 4.12" (2 — 29) + faz). 
400 
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Here f, is a polynomial of degree Sn. Therefore 
fhe) = a(2)(@ — 20) +r, 


and 
fay) = lang” + a) — 20) +r, 


which has the desired form. Therefore n + 1 € S, and S contains all the positive 
integers. : 
In this theorem, when r — 0, we have 


f() = «(Gs — 20): 
Here we say that z — zo divides f (z). In the equation 


f) = «(e — 20) t ^» 
we set z = zo. This gives 
f(zo) = r. 


Thus we have: 
Theorem 2. The Remainder Theorem. If 
f) = ae — 20) +r, 
feo) = r. 


then 


Theorem 3. The Factor Theorem. If zo is a root of the equation, 
f(z) = ane” + an—12" 71 + +++ + az + ao = O, 
then z — zo divides f(x), and conversely. 
Note that we could not have proved Theorems 2 and 3 by first writing 


T 
Z2 — 20 


Jones 
"Uma ga q(z) + , 
and then multiplying by z — zo to get the equation that we really want. The 
division is valid only if z — zo »€ 0, and z = žo happens to be the very value that 
we are interested in. 

An algebraic equation of degree n (n = 1) is an equation of the form 


anz” + a2 5 + +++ + ait ao = O, 


where all of the coefficients a; are integers, and where a, # 0. 

The fundamental theorem of algebra, due to C. F. Gauss, asserts that every 
algebraic equation has at least one root in the field of complex numbers. Of course 
there may not be any roots in the field of real numbers. Moreover, the only roots 
that are easy to find, for n > 2, are the rational ones. The method of finding 
these is based on the following theorem. 


402 THE THEORY OF EQUATIONS 


Theorem 4. Let x = p/q, in lowest terms, be a root of the equation 
Ont” + a4 4277! + +++ + az H ag = O. 


Then p divides ap and q divides an. 


Proof. We have 
=I 
anp” 5 anp ae 
= 0. 
q^ F q^-1 Senn 7+ — + ao 
Therefore 
anp” + anp” 1g + +++ + ap! + agg" = 0. 


We know that ged (p, q) = 1. Hence p and q have no prime factor in common. 
Thus q and p" have no prime factor in common. Therefore ged (p^, q) = 

Since q divides every term in the equation after the first term anp”, it follows 
that q divides anp”. Since ged (p^, q) = 1, it follows by Theorem B-6 that.q 
divides an. 

In exactly the same way, we see that p Jadea ao. 

The applications of this theorem can be very tedious; we have only a finite 
number of things to try, but finite numbers can be large. For example, given the 


equation 
8:3 — x? — 27 = 0, 


the only possible rational roots are the numbers 4-p/q, where q = 1, 2, 4, 8 and 
p = 1,3,9, 27. This gives 32 possibilities. On the other hand the theorem some- 
times enables us to conclude very quickly that an equation has no rational roots at 


all. Consider 
—2z 4-2. — 0, 


Here the only possible rational roots are 1, —1, 2, and —2. None of them works. 
Therefore there are no rational roots. 
Finally, let us recall one of the consequences of the factor theorem. Given a 


polynomial 
falx) = z^ + Gaya"! + +++ + aya + ag. 
If x, is a root of f,(x) = 0, then we have 
fuz) = (x — @1)fn—i(2). 
If xz is a root of f, (x) = 0, then (x — xə) is a divisor of fÍn-i(2). Thus 
Jala) = (x — zi)(x — 2)fn—a(z). 
In a finite number of such steps, we get a factorization, 
Jala) = (rx — n)(r — z2) +++ (z — za). 


Each of the numbers 2; is a root of the equation falz) = 0. And no other number 
is a root, because a product is =0 only if one of the factors is =0. (This principle 
applies, of course, to complex roots as well as real roots.) 


«—o«A———————— — 
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It is important to remember, in all of the above discussions, that the numbers 2; 
are not necessarily different. Tf we collect the repeated factors of fn(x), we get 


a factorization of the form 
fala) = (c — b)h(s — bh. @ — bm)*m. 


We then say that each b; is a root of multiplicity ki; and we observe that the sum 


of the multiplicities of the roots is the degree of the equation. 
People often refer to this fact by saying that “every algebraic equation of degree 


n has n roots.” But the latter statement, taken at face value, is silly. There are 
simple examples to show that the number of roots may be any integer from 1 to n. 
For example, the equation 710 = 0 is of degree ten, but it has only one root, 
namely, zero. 

The upper bound n, for the number of roots, 
never be alike unless they look alike. 


tells us that two polynomials can 


Theorem 5. If 
a aim Db rb mtd oo = baa” + byt" + +++ + bit + bo, 


for every x, then a; = b; for every i from 0 to n. 


Proof. Every number is a root of the equation 


(an. — bn)” + (611 — hc) cepe (aj — b1)z + (ao — bo) = 0. 
degree, so that a; = b; fori > 0. 


Therefore this equation cannot have a positive 
= bo. Therefore a; = b; for every 


The equation must therefore take the form ao 
i from 0 to n. 


EI 


LIMITS OF SEQUENCES 


THE DEFINITION OF A LIMIT FOR SEQUENCES 


Given a sequence, 
01,025, ..., 
of real numbers. When we write 
lim a, — a, 
no 
this means, roughly speaking, that when n is very large, then a, is very close to a. 
For example 


A Sam S ELT JA. 
tim (HH) = tim (1+ 3) = 1, 
and so on. 


Let us now try to frame a definition of this idea, in a sufficiently exact form to 
enable us to prove things about it. 

In the first place, when we say that a, is close to a, this means that |a, — a| is 
small. The idea, then, is that we can make |a, — a| as small as we please, merely 
by making 7 large enough. 

To say how small we want |a, — a| to be, we should name a positive number, 
say e, and then demand that 

lan — al < e. 


To explain what integers n are large enough, we should name a positive integer 
N, and require that n > N. In these terms, when we say that 
lim a, = a, 
n-» 
we are saying that no matter what number e > 0 is given, there is always an 


integer N with the property that |a, — a| < e for every n > N. This suggests 
the following definition. 


DEFINITION. lim,.,4 a; = a means that for every e > 0 there is an integer N 
such that if 
n> N, 
then 
la. — a| < e. 
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Let us try this out on some simple examples. First we shall show, using the above 
definition, that 
(1) lim, 44 1/n = 0. 
By definition, this says that 
(2) for every e > 0, there is an integer N such that if 
n>N, 


then 
| — o| « e. 
n 


The desired N is easy to find: 1 /n < e means merely that n > 1/e. By the 
Archimedean postulate, some integer N is greater than 1/e. For n 7 N, we have 


1/n < e, as desired. 
(The use of the Archimedean postulate was essential in this proof. In fact, the 


statement that lim, , 1/n = 0 is precisely equivalent to the Archimedean postu- 
late. This accounts for the fact that people who are unacquainted with the founda- 
tions of the real number system are likely to be embarrassed when asked to prove 
this very simple limit theorem.) 
Let us try the same thing for 
lim Seda 1, 


no n 
Given e > 0, we want an integer N such that if 


n>N, 


then 

BI — 1| « e 
n 

The second inequality is equivalent to 


n 
n 


We can therefore let N be any integer greater than 1/e, as before. 

An upper bound of a sequence d;, 42, . - - is simply an upper bound of the set {an}. 
A sequence which has an upper bound is said to be bounded above. A lower bound 
of a sequence aj, dg, .. . is a lower bound of the set {an}. A sequence which has 
a lower bound is said to be bounded below. A sequence is bounded if it is both 
bounded above and bounded below. This is equivalent to the statement that there 


is a number b such that |an| < b for every n. If 
a Sap $:: SM SO E 


then the sequence is called increasing. (If the strict inequality an < Gn41 always 
holds, then the sequence is called strictly increasing. But this idea does not come 
up very often.) 
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The following theorem is a consequence of the Dedekind postulate. 
Theorem 1. If a sequence is increasing, and has an upper bound, then it has a 
limit. 


Proof. Let the sequence be 


01,05,.... 


Since {an} has an upper bound, it follows by the Dedekind postulate that {an} has 
a least upper bound. Let 


a = sup {ap}. 
We shall show that | 
lim a, = a. 


no 


Let e be any positive number. Then 
ay >a—e 


for some integer N, because a — e is not an upper bound of (a4). If n > N, then 
an 2 ay, because the sequence is increasing. Thus, if 


n>N, 
then 


an > à — e. 
But a, < a + e for every n, because a is an upper bound of {an}. If 


a@—e€< a «a- e, 


then 
£6 Sty — 00x 6; 

and 

lan — al < e. 
'Thus if 

n>N 

then 

lan — al < e. | 


That is, the number N that we found is of the sort we were looking for. 
The other fundamental theorems on limits of Sequences are as follows, 


Theorem 2. If lim, ,, a, = a, and limp.» b, = b, then lim, ,, (a, + b.) = 
a+ b. 


Theorem 3. If lim, ,. an = a and lim, ,, b, = b, then lim, 0 (a, * bn) = ab. 


Theorem 4. If lim, |, an = a, and a, S k for every n, then a S k. 
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Theorem 5. If lim4.,4 à, = 4, and lim, ,4 b, = b, and a, € b, for every n, 
then a <£ b. 


Theorem 6. The Squeeze Principle. If 


a € b, S à. 
for every n, and 
lim a, = 4, 
no 
then 
lim b, = a. 
nw 


Of course the same conclusion follows if an € b, £ a for every n. We shall 


refer to both of these theorems as the squeeze principle. 
We shall not prove these theorems. Instead, we give in the following problem 


set a sequence of theorems that should lead you to the proofs by fairly easy stages. 


PROBLEM SET 


1. Show that 


lim a, = à 
nO 


if and only if 
lim (an — a) = 0. 


2. Show that if lima» an = 0 and lim, ,4 bn = 0, then lima as (an + bn) = 0. 
[Hint: You want |an + bn| < e. This will hold if the inequalities |an| < ¢/2 and |b,| < e/2 
both hold.] 

3. Prove Theorem 2. 

4. Show that if limn» a, = a, then the sequence is bounded. 

5. Show that if lima» a, = 0 and {bn} is bounded, then lim, nbn = 0. [Hint: 
If [bn] < b, and |an| < e/b, then |anbn| < eJ 

6. Show that if limas» a, = a and lima bn = b, then 


lim [an(bn — 5)] = 0. 


7. Show that if limp. a, = a and lima b, — b, then 


lim [bn(an — a) + abs — b)) = 0. 


8. Prove Theorem 3. 

9. Prove Theorem 4. [Hint: Suppose that a > k, let a — k = 
the definition of a limit.] 

10. Prove Theorem 5. 

11. Prove Theorem 6. 


e > 0, and use e in 
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32.1 FINITE AND COUNTABLE SETS 


By a segment of the integers we mean a set of the form 
I, = {1,2,3,..., n}. 


A finite sequence is a function whose domain is a set Ip. Usually we write sequences 
in the subscript notation 


01,02, . . ., Anj 


here for each 7, a; is the object which corresponds to 7 under the action of the 
function. 
Let A be a set. If there is a one-to-one correspondence 


fi:InoA 


between A and a set Zn, then we say that A is finite and has n elements, and we 
write 

Aw Is. 
In general, when we write 

A ~B, 


this means that there is a one-to-one correspondence between the set A and the 
set B. In this case, we say that A and B are equivalent sets. 

As usual, by an infinite sequence (or simply a sequence), we mean a function whose 
domain is the entire set N of natural numbers, In the subscript notation, we 
write sequences as 

01,02, ...; 


here a; is the object corresponding to the natural number 1. 

A set A is called countable if there is a sequence in which every element of A 
appears at least once. Note that since repetitions are allowed, every finite set is 
countable; we merely repeat one of its elements over and over. If 


A ^N, 


then we say that the set A is countably infinite. Thus the set N itself is countably 


infinite, ex officio; we let each natural number n correspond to itself. And the 
408 
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set Z of integers is countably infinite; we can arrange it in the sequence 
0; 4-1, 2, 2h ye es 
in which every integer appears exactly once. 
Theorem 1. Every countable set is either finite or countably infinite. 
That is, if the elements of the set A can be arranged in a sequence 
03; 04; eve Oni nil 


with repetitions allowed, then either A ~ In for some nor A ~ N. To prove this, 
we merely need to eliminate the repetitions from the sequence which is given 
by hypothesis, thus getting a sequence 


ORE ER 
yw E denda 


in which every a; appears exactly once. The method of doing this is fairly ob- 
vious. Let 


or 


b; = 0. 
bi, ba, Perey bi, 
we look to see whether every one of the a,’s has already been listed. If so, we 
have finished; n = 1, and the set A is finite. If not, we let bi+ı be the first term 


of the a-sequence which has not been listed so far. 
If this process terminates, then A is finite. If the process does not terminate, 


we get an infinite sequence 


Given 


bi, ba,- + +5 


with no repetitions. The new sequence includes all the elements of A, because 
for each n, a, is one of the objects bi, ba, ..-, bn. (Query: Under what conditions 
will a, — b, for a particular »? Under what conditions will it be true that an = bn 
for every n?] 


Theorem 2. The union of a countable collection of countable sets is countable. 


Proof. Given 
A=A UAU UAU’, 


where each A; can be arranged in a sequence 
A; Gii Gia, tiay -+ 
Let us regard the objects a;; as forming & doubly infinite array, as follows: 


Qi G2 M13.. 
a21 G22 423--- 
31 032 433--- 
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In this array, consider the diagonal sequences 


011; 
012, 021, 
013, 022, 031, 


Qin, 02,n—1; 03;n—2, » » »  Qn1 


Laying these finite sequence end to end, we get a single sequence which includes 
all of the a;;’s. 

Note that this theorem and its proof include the cases of (1) a finite collection 
of countably infinite sets, (2) a countably infinite collection of finite sets, and 
also the two other possibilities. The definition of a countable set was stated in 
such a way as to permit us to take care of all these cases at once. It is better to 
apply Theorem 1 at the times when we really need it than to worry about elimi- 
nating repetitions at most stages of most proofs. 
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1. Show that the rational numbers between 0 and 1 form a countable set. 
2. Show that the set Q of all rational numbers is a countable set. 


3. Let Eg be the set of all points (z, y) in a coordinate plane for which z and y are both 
rational. Show that Eg is countable. 


32.2 THE COUNTABILITY OF THE SURD FIELD 


We recall, from p. 228, the definition of a surd. A real number z is a surd if we 
can calculate z by a finite number of additions, subtractions, multiplications, 
divisions, and extractions of square roots, starting with 0 and 1. S is the set of 
all surds. 

We shall show, using Theorem 2 of the preceding section, that S is countable. 

Let S, be the set of all surds that can be calculated by n operations of the sort 
that are allowed. Then S, is finite for every n. The proof is by induction. 

(1) S, is finite. (Because its only elements are 0 and 1.) 

(2) If S, is finite, then so also is S, ,;. 


Proof. Let kn be the number of elements in S,. The elements of S, are the 
numbers of the form z + y, x — y, zy, z/y, x, where z and y belong to Sp. 
For each of the first four of these forms, there are surely no more than k pos- 
sibilities, because there are at most k, choices for z and kn choices for y. And 
there are at most k, possibilities for \/z. Therefore the number of elements in 
Sn+1 is surely no greater than 4k; + k, (and, in fact, it is easy to see that this 
is a gross overestimate). Therefore Sn+1 is finite. 

Since 

S= 8, USU USU, 


it follows by Theorem 2, Section 32.1, that S is countable. 
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32.3 PROOF THAT THE REAL NUMBERS ARE UNCOUNTABLE 


To show that the real numbers cannot be arranged in a sequence, we need a 


preliminary result. 
By a closed interval (of the real numbers), we mean a set 


[a,b] = {tla £ x s 9. 
By a nested sequence of closed intervals we mean a sequence 
(a1, bı), [a2; bal, - - - 
in which 
(aia, bii] C lai, bil 


for every 1. 


Theorem 1. The intersection of a nested sequence of closed intervals is not 
empty. 


‘That is, some number Z lies in every interval in the sequence. The proof is as 


follows: 
(1) Let A = (aj), and let B = {bj}. 


Then 
a; € bi 


for every i, by definition of an interval. And 
a; S aiga € bii S bi, 

because the sequence is nested. Ifi < j, then 

a; < bj, 
because 

A E E € b; 
Similarly, a; < b;if i 2 J. Therefore every b; is an upper bound of A. 
(2) Let 7 be the supremum of A, that is, the least upper bound of A. Then 


áz for every 1, 
because Z is an upper bound of A. And 

sb; for every 1, 
because z is the least upper bound of A. Therefore 


x € (ai, b] for every 1, 


which was to be proved. 
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Theorem 2. The set R of all real numbers is uncountable. 


Proof. Suppose that R can be arranged in a sequence 
X1,22,***, In eee, 


We set up a nested sequence of closed intervals, in the following way. 
(1) Let [a;, b] be any interval not containing 2}. 
(2) Given a finite sequence 


[ai bi], [a2, by], 2976 8 [as, bn), 


nested as far as it goes, such that [an, bn] contains none of the numbers 21,232, tae 
Zn. Let [an41, bn 41] be any interval which lies in [an, bn] and does not contain 
In+1- 

Now let I be the intersection of all the intervals in the sequence. Thus 


I= [a1, bi] N [a2, 65] N --- N [as ba] n - 


Then J is empty. The reason is that every real number z is supposed to be equal to 
t, for some n; and z, cannot belong to 7 , because x, does not belong to [an, bn]. 

But I cannot be empty, by Theorem 1. Thus we have a contradiction, and 
Theorem 2 must be true. 

We found in Chapter 19 that some real numbers (for example, cos 20° and 4/2) 
are not surds. The results of this section and the preceding one give an independent 
proof of the same fact: it is impossible for S and R to be the same set, because S 
is countable and R is not. In fact, since the nonsurds are more numerous, it is, 
in à way, a remarkable accident for a real number chosen at random to be a surd. 
The surds are, of course, more familiar, but this is not because they are more 
common; it is merely because they are easier to describe. 


PnoBLEM Ser 32.3 


1. In the preceding paragraph, it is stated that the nonsurds are more numerous than 
the surds. Justify this statement, by showing that R — S is uncountable. 


INDEX 


INDEX 
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AAA similarity theorem, 144 

AA similarity theorem, 145 

Abscissa, 208 

Absolute plane geometry, Chapter 10, 
122 ff. 

Absolute value, 15 

Acute, 78 

Addition, for congruence classes of 
segments, 109 

Addition theorem for are length, 270 

Addition theorem for ratios, 252 

Additivity postulate, for area, 154 

Algebraic identity, 7 

Alternate interior angles, 123 

Altitude, 147 

Angle, 55 

of a quadrilateral, 70 

Angle-addition postulate, 76 

Angle-addition theorem, 78 

Angle-construction postulate, 76 

Angle-construction theorem, 78 

Angle of parallelism, 307 

Angle-sums in a triangle, 130 

Angular measure, Chapter 5, 74 ff. 

Antipodal, 119 

Apollonius, problem of, 224 

Archimedean postulate, 25, 246, 250, 405 

Arc length, 263 

Arcs of circles, 192 

Area function, construction of, Chapter 14, 
168 ff. 

Area postulates, 153-156 

Area of à sector, 276 

Area theorems, for simple figures, 156-159 

ASA theorem, 86 7 

Associative law, generalized, 3 

Associativity, 2 


Basic similarity theorem, 138 
proved using area theory, 162-163 


Bers, Lipman, 383 
Betweenness, defined in terms of distance, 


50 
postulates, 103 
Bisector of an angle, 89 
Bolyai, 388 
Bolyai's theorem, 328 
for polygonal regions, in the hyperbolic 
case, 342 
for triangles, in the hyperbolic case, 
334 
Border triangulation, 332 


Cancellation law, of addition, 5 
of multiplication, 5 

Cartesian coordinate systems, Chapter 18, 
208 ff. 

Cartesian model, for Euclidean geometry, 
367 

Categoric postulate sets, 383 

Cavalieri’s principle, 292 

Center, 186 

Central angle, 192 

Characterization theorem, 178 

Chord, 186, 187 

Circle, 186 

Circles and spheres, Chapter 16, 186 ff. 

Circular sector, 273 

Closed triangle, 317 

Collapsible compasses, 215 

Collinear, 37 

Commutativity, 2 

Comparison theorem, 138 

Complementary, 78 

Complete in the sense'of Dedekind, 259 

Completeness, Euclidean, 25 

Complex, 169, 324 

Computing machine, 86 

Concentric, 186 

Cone, 293 


416 INDEX 
Congruence(s), for angles, defined in terms 
of distance, 77 
for circles, 189 
for segments, defined in terms of distance, 
58 
Congruence postulate(s), for angles, 104 
for area, 154 
for segments, 103 
Congruences between triangles, Chapter 6, 
81 ff. 
Conjugates, in a quadratic extension field, 
233 
Consistency of the hyperbolic postulates, 
Chapter 25, 348 ff. 
Consistent, 131 
Constructions with ruler and compass, 
Chapter 19, 214 ff. 
Continuity of the defect, 339 
Continuous functions, 287 
Convex, 61, 325 
Convex hull, 63 
Convex polygonal region, 325 
Convex quadrilaterals, 70 
Coordinate system, 49, 208 
Coplanar, 37 
Corresponding angles, 124 
Critical function, 308 
Critical number, 307 
Critically parallel, 312 
Crossbar theorem, 69 
Cross ratio, 352 
Cross sections, of cones and pyramids, 
293 
Cubic equations, 236 
Cylinder, 296 


Declaration of Independence, 382 
Dedekind cut, 261 
Dedekind postulate, 250 
Defect(s) of a polygonal region, 331 
of small triangles, 337 
of a triangle, 321 
Degree measure for angles, 74 
Descartes, 258 
Diameter, 186, 187 
Difference, of two sets, 28 
Dihedral angle, 72 
Directed angles, 57 
Directrix, 296 


Distance, formula, 209 
in the Poincaré model, 116 
postulates for, 47-49 
Distributive law, 3 
Divisor, 395 
Domain of a function, 41 
Duplication of the cube, 228, 241 


Edge, of a dihedral angle, 72 
of a region, 153 
of a triangular region, 153 
Element, of a cylinder, 296 
Empty set, 13 
Equals, 90 
Equals sign, meaning of, 3 
Equiangular, 86 
Equidistant, 176 
Equilateral, 85 
Equivalence, by finite decomposition, 327 
for rays, 319 
relation, 58 
Euclidean completeness postulate, 25 
Euclidean parallel postulate, 114 
Eudoxus, 243 
Exterior angle, 95 
Exterior angle theorem, 96 
for closed triangles, 317 
Exterior of an angle, 65 
Exterior of a circle, 186 
Exterior of a sphere, 187 


Face of a half space, 72 
Factor theorem, 401 
Fields, postulates for, 9 
Figure, 37 
Function, 40 
definition of, 46 
Fundamental theorem of algebra, 401 


Gauss, 317, 383, 401 

Generalized associative law, 29 

Geometric inequalities, synthetically con- 
sidered, 107 

Gettysburg Address, 382-383 

Graph of a condition, 210 

Great circle, 119 

Greatest common divisor, 395 


Half planes, 64 


INDEX 417 


Half spaces, 72 Logical identity, 90 
Heine, 149 Lower base, of a Saccheri quadrilateral, 125 
Hilbert, 91, 106 Lower bound of a sequence, 405 
Hyperbolie geometry, 118, Chapter 24, Lowest terms, 396 
306 ff. 

Hypotenuse-leg theorem, 150 Major are, 192 

Measure, 278 
Image, 43 for angles, 74 
Impossible construction problems, 227 of a dihedral angle, 184 
Incidence geometry, Chapter 2, 36 ff. in the sense of Jordan, 278 
Incidence theorems, 64 Measurable sets, 280 
Increasing, 405 Metrization theorem, 257 
Independence of the SAS postulate, 91 Midpoint, 60 
Independent, 131 Minor are, 192 
Induced correspondence, 82 Model, 131 
Induced isometries, 205-206 Models, 115 
Induction principle, 19, 21 Monotonic, 278 


Inequalities, 11, 12 
geometric, Chapter 7, 95 ff. 

Inner measure, 277 

Inscribed angle, 194 

Inscribed broken line, 263 


Natural numbers, 19 
Negative of a number, 2 


Integers, 22 Obtuse, 78 
Intercepted arc, 195 One-to-one correspondence, 42 
Interior, of an angle, 65 Onto, definition of, for functions, 42 
of a circle, 186 Open sentence, 12 
of a dihedral angle, 73 Open triangle, 311 
of a triangle, 67 Opposite angle, 65 
Intersection, 28 Opposite rays, 64 
Inverse of a function, 43 Opposite sides of a line, 64 
Inversions, 348 Ordered field(s), 18 
Isometry, 203 non-Archimedean, 388 
Order relations, 18 
Jefferson, Thomas, 91, 383 Ordering of the real numbers, 11 
Jordan measure in the plane, Chapter 22, Ordinate, 208 
277 ff. Origin, 208 
Outer measure, 277 
Landau, E., 35, 262 
Lebesgue, H., 290 Pappus, 86 
Limits, as the mesh approaches zero, 268 Parallel, 114 
of sequences, Chapter 31, 404 ff. Parallelism, 122 
Lincoln, Abraham, 383 for planes, 181 
Line, 36 Parallelepiped, 296 
Linear equation in z and y, 210 Parallelogram, 133 
Linear pair, 77 Parallel projections, 134 
Line-circle theorem, 189 Pasch, postulate of, 53 
Lobachevsky, 383 Perpendicular bisector, of a segment in a 
Lobachevskian parallel postulate, 114 plane, 178 


Locus, 178 of a segment in space, 178 
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Perpendicular lines and planes in space, 
Chapter 15, 176 ff. 

Perpendicular rays and lines, 78 

Perpendicularity, for planes, 185 

Perpendiculars, 93 

Pi, existence of, 265 

Plane, 37 

Plane-separation postulate, 62 

Poincaré model, 115, 348 ff. 

Point of tangency, 187 

Polygonal inequality, 124 

Polygonal region, 153 

Polynomial of degree n, 400 

Pons asinorum theorem, 85 

Prime number, 396 

Prisms, 296 

Proportional, 142 

Proportionality, constant, 142 
synthetically defined, 243-246 

Punctured plane, 348 

Pyramid, 295 

Pythagorean theorem, 148 
Euclid’s proof of, 163-164 
proved using area theory, 162-163 


Quadratie extensions, 232 
Quadrilateral, 70 


Radius, 186 
Range of a function, 41 
Rational numbers, 22 
Ratios, synthetically defined, 249-250 
Ray, 37, 55 
Real numbers, Chapter 1, 1 ff., 411 
Reciprocal, 3 
Rectangle, 125 
Redundant, 131 
Reflections, 357 
Relation, definition of, 46 
Remainder theorem, 401 
Rhombus, 133 
Riemannian parallel postulate, 115 
Right angle(s), 77 

synthetically considered, 105 
Right cylinder, 296 
Right dihedral angle, 185 
Rigid motion, Chapter 17, 202 ff. 
Root of multiplicity k, 403 
Rotation of the Cartesian model, 372 


Ruler placement theorem, 50 
Ruler postulate, 49 K 
in the Poincaré model, 360 


SAA theorem, 101 
Saccheri, 130 
Saccheri quadrilateral(s), 125, 319 
SAS postulate, 84 
in the Poincaré model, 364 
SAS similarity theorem, 146 
Scalene, 85 
S-circle, 229 
Secant line, 187 
Secant plane, 190 
Segment, 37, 54 
Segment addition, 247 
Segment-addition postulate, 103 
Segment-addition theorem, 59 
Segment-construction postulate, 103 
Segment-construction theorem, 59 
Segment-subtraction theorem, 60 
Self-evident truths, 382 
Semicircles, 192 
S-equation, 229 
Sets, 28 
countable, 408 
finite, 408 
Side(s), of an angle, 55 
of a dihedral angle, 72 
of a plane, 72 
of a quadrilateral, 70 
of a triangle, 55 
Similar, 143 
Similarities between triangles, Chapter 12, 
142 ff. 
S-line, 229 
Smaller than, for angles, 79 
Solid mensuration, Chapter 23, 291 ff. 
Solution set, 13 
Space-separation postulate, 72 
Spherical model for Riemannian geometry, 
119 
Spherical sector, 305 
Spherical segment, 305 
S-point, 229 
Square region, 155 
Squeeze principle, 407 
SSS similarity theorem, 145 
SSS theorem, 87 


Star triangulation, 325 
Strictly increasing, 405 
Subdivisions, 324 
Superposition, 366 
Supplement postulate, qu 
Supplementary angles, 77 
Supremum, 250 
Surd, circles, 231 

field, 228, 410 

field of order n, 235 

lines, 231 

of order n, 236 

plane, 230 
Synthetic postulates, 102 


Tangent line, 187 
Tangent plane, 190 


Translation of the Cartesian model, 371 


Transversal, 123 
Trapezoid, 133 
Triangle, 55 

Triangle theorem, 198 
Triangular region, 153 
Triangulations, 169, 324 
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Trisection of the angle, 228, 238 
Two-cirele theorem, 197 


Union, 28 

Unique factorization theorem, 397 
Uniqueness of hyperbolic area theory, 345 
Unit postulate, 155, 165 

Unmarked rulers, 214 

Upper base, of a Saccheri quadrilateral, 125 
Upper bound of a sequence, 405 


Vertex, of an angle, 55 

of a triangle, 55 

of a triangular region, 153 
Vertical angle theorem, 79 
Vertical pair, 79 
Vertical projection, 134 
Volume(s), 291 

of cylinders, 300 

of prisms, 298 

of pyramids and cones, 300 

of a sphere, 303 


Well-ordering principle, 21 
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